DETECTING HORMONALLY ACTIVE COMPOUNDS 



CROSS-REFERENCE TO RELATED APPLICATIONS 
The present application claims the priority of U.S. provisional patent application 
number 60/410,414 filed on September 13, 2002. 

STATEMENT AS TO FEDERALLY SPONSORED RESEARCH 
The invention was made with U.S. government support under grant number P42 
ES07375 awarded by the National Institute of Environmental Health Sciences, and grant 
numbers CR826357-01-0, ID-5267-NTEX, and OD-5378-NTGX awarded by the 
Environmental Protection Agency. The U.S. government may have certain rights in the 
invention. 

SEQUENCE LISTING 
The present application contains a sequence listing on compact disc which is 
hereby incorporated herein by reference. The sequence listing file is entitled 5853- 
238.ST25.txt, contains 427 kilobytes and was created September 15, 2003. 

FIELD OF THE INVENTION 
The invention relates to the fields of molecular genetics, endocrinology, and 
toxicology. More particularly, the invention relates to compositions and methods for 
detecting androgenic/estrogenic agents in the environment and screening candidate 
agents for androgenic/estrogenic activity. 

BACKGROUND OF THE INVENTION 
The last decade saw the emergence of the field of endocrine disruption after it was 
discovered that a variety of anthropogenic chemicals act as weak estrogens. Through 
their interaction with estrogen receptors (ERs), these endocrine-disrupting compounds 
(EDCs) can alter normal expression of gene products and proteins at critical times during 
development and reproduction. Environmental contamination with EDCs is therefore a 
serious concern. 

In an effort to detect EDCs in environmental samples, a number of methods have 
been developed including both in vitro and in vivo assays. Available in vitro assays 
include those based on hormone receptor-ligand binding, cell proliferation, and reporter 
gene expression. Although these are relatively inexpensive and amenable to high 
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throughput applications, they provide only limited information about how EDCs affect 
animals in the environment (see, e.g., Zacharewski T. Environ. Sci. Technol. 31:600-623, 
1997; Baker V.A. Toxicol In vitro 15:413-419, 2001). In vivo exposure assays, on the 
other hand, provide useful information about whole animal responses to EDCs, but can 
be more cumbersome and expensive than in vitro assays. Moreover, such assays do not 
provide information about the molecular mechanisms underlying EDC-mediated changes 
in the animals. 

SUMMARY 

The invention is based on the discovery of a large number of sheepshead minnow 
(SHM) and largemouth bass (LMB) genes that are up-regulated or down-regulated in 
tissues that have been exposed to an estrogenic or androgenic agent. Thus, whether an 
environmental sample contains an estrogenic or androgenic agent can be determined by 
examining a fish (or biological sample obtained from the fish) that was exposed to the 
sample (e.g., a lake or river) for modulation of expression of these genes. A finding that 
these genes were modulated in the test fish compared to a control fish not exposed to the 
sample (or an estrogenic or androgenic agent) indicates that the sample contains an 
estrogenic or androgenic agent. It was also discovered that different classes of 
estrogenic or androgenic agents modulated expression of the genes in different patterns 
depending on the class or mechanism of action of the estrogenic or androgenic agent. 
Thus, the invention can be used to discern that a particular type of estrogenic or 
androgenic agent is present in the sample. Based on these discoveries, a screening assay 
to characterize an unknown molecule's hormonal (e.g., estrogenic or androgenic) activity 
was developed wherein a fish, fish tissue or fish cell is exposed to a test substance and the 
effect of the substance on gene expression is compared to known patterns of gene up- or 
down-regulation. On this basis, the agent can be classified as estrogenic or androgenic 
and is thus determined to be hormonally active. 

Accordingly, the invention features a method for detecting the presence of an agent 
having estrogenic or androgenic activity in a sample (e.g., a water sample). The method 
includes the steps of: (A) providing at least one (e.g., at least 2, 3, 4, 5, 10, 25, 100) 
fish cell which was exposed to the sample; (B) analyzing the at least one fish cell for 
expression of at least one gene wholly or partially encoded by a nucleotide sequence of 
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SEQ ID NOs: 1-560; and (C) comparing the expression of the at least one gene in the cell 
compared to the expression of the at least gene in a control cell not exposed to the sample 
or an agent having estrogenic or androgenic activity. A difference in the expression of 
the at least one gene in the at least one fish cell compared to the expression of the at least 
one gene in the control cell indicates that the sample contains an agent having estrogenic 
or androgenic activity. 

In the method, the fish cell can be a large mouth bass cell or a sheep's head 
minnow cell. It can also be one obtained from a fish that had been exposed to the 
sample. 

Also in the method, the step of analyzing the at least one fish cell for expression 
of at least one gene (e.g., at least 2, 3, 4, 5, 10, 25, 100) might involve isolating RNA 
transcripts from the at least one cell, and the step of analyzing the at least one fish cell for 
expression of at least one gene can include contacting the isolated RNA transcripts or 
nucleic acids derived therefrom using the isolated RNA transcripts as templates with at 
least one probe (e.g., at least 2, 3, 4, 5, 10, 25, 100) that hybridizes under stringent 
hybridization conditions to at least one nucleotide sequence of SEQ ID NOs: 1-560. 

The probe can be immobilized on a substrate such as nylon, nitrocellulose, glass, 
and plastic. It can be on conjugated with a detectable label. In one variation of the 
method of the invention, the isolated RNA transcripts or nucleic acids derived therefrom 
are conjugated with a detectable label. 

The method of the invention might also include analyzing the control cell not 
exposed to the sample or an agent having estrogenic or androgenic activity for expression 
of at least one gene wholly or partially encoded by a nucleotide sequence of SEQ ID 
NOs: 1-560. In this version of the method, the step of analyzing the control cell for 
expression of at least one gene can include isolating RNA transcripts from the control cell 
and contacting the isolated RNA transcripts or nucleic acids derived therefrom using the 
isolated RNA transcripts as templates with at least one probe (e.g., at least 2, 3, 4, 5, 10, 
25, 100) that hybridizes under stringent hybridization conditions to at least one nucleotide 
sequence (e.g., at least 2, 3, 4, 5, 10, 25, 100) of SEQ ID NOs: 1-560. Also in this 
version of the method, the RNA transcripts or nucleic acids derived therefrom isolated 
from the at least one fish cell can be conjugated with a first detectable label and the RNA 
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transcripts or nucleic acids derived therefrom isolated from the control cell are 
conjugated with a second detectable label differing from the first detectable label. 

For example, the method can include isolating RNA transcripts from the at least 
one fish cell and contacting the RNA transcripts isolated from the at least one fish cell or 
nucleic acids derived therefrom using the RNA transcripts isolated from the at least one 
fish cell as templates with at least one molecule that hybridizes under stringent conditions 
to at least one nucleotide sequence of SEQ ID NOs: 1-560. The at least one probe can 
be conjugated with a first detectable label and the at least one molecule can be conjugated 
with a second detectable label differing in chemical structure from the first detectable 
label The step of comparing the expression of the at least one nucleic acid in the cell 
compared to the expression of the at least one nucleic acid in a control cell not exposed to 
the sample or an agent having estrogenic or androgenic activity may be performed by 
quantifying the amount of first detectable label associated with the RNA transcripts 
isolated from the control cell or nucleic acids derived therefrom, and quantifying the 
amount of second detectable label associated with the RNA transcripts isolated from the 
at least one fish cell or nucleic acids derived therefrom. 

An additional variation of the method of the invention also includes the steps of 
contacting the fish with the sample; and isolating the at least one fish cell from the fish 
contacted with the sample. 

In another aspect, the invention features a method for determining whether an 
agent has estrogenic, anti-estrogenic, androgenic or anti-androgenic activity. This 
method includes the steps of: providing at least one fish cell; contacting the at least one 
fish cell with the agent; analyzing the at least one fish cell for expression of at least one 
gene wholly or partially encoded by a nucleotide sequence of SEQ ID NOs: 1-560; and 
comparing the expression of the at least one gene in the cell compared to the expression 
of the at least one nucleic acid in a control cell not exposed to the sample or an agent 
having estrogenic or androgenic activity. A difference in the expression of the at least 
one nucleic acid in the at least one fish cell compared to the expression of the at least one 
nucleic acid in the control cell indicates that the agent has estrogenic, anti-estrogenic, 
androgenic, or anti-androgenic activity. 
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Yet another aspect of the invention is a substrate having immobilized thereon at 
least one (e.g., at least 2, 3, 4, 5, 10, 25, 100) nucleic acid comprising a nucleotide 
sequence of SEQIDNOs: 1-560 and complements thereof. 

Unless otherwise defined, all technical terms used herein have the same meaning 
as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Commonly understood definitions of molecular biology terms can be found in 
Rieger et al., Glossary of Genetics: Classical and Molecular, 5th edition, Springer- 
Verlag: New York, 1991; and Lewin, Genes V, Oxford University Press: New York, 
1994. Commonly understood definitions of microbiology can be found in Singleton and 
Sainsbury, Dictionary of Microbiology and Molecular Biology, 3rd edition, John Wiley 

& Sons: New York, 2002. 

By the term "gene" is meant a nucleic acid molecule that codes for a particular 
protein, or in certain cases a functional or structural RNA molecule. 

As used herein, a "nucleic acid" or a "nucleic acid molecule" means a chain of 
two or more nucleotides such as RNA (ribonucleic acid) and DNA (deoxyribonucleic 
acid). A "purified" nucleic acid molecule is one that has been substantially separated or 
isolated away from other nucleic acid sequences in a cell or organism in which the 
nucleic acid naturally occurs (e.g., 30, 40, 50, 60, 70, 80, 90, 95, 96, 97, 98, 99, 100% 
free of contaminants). The term includes, e.g., a recombinant nucleic acid molecule 
incorporated into a vector, a plasmid, a virus, or a genome of a prokaryote or eukaryote. 
Examples of purified nucleic acids include cDNAs, fragments of genomic nucleic acids, 
nucleic acids produced by polymerase chain reaction (PCR), nucleic acids formed by 
restriction enzyme treatment of genomic nucleic acids, recombinant nucleic acids, and 
chemically synthesized nucleic acid molecules. A "recombinant" nucleic acid molecule 
is one made by an artificial combination of two otherwise separated segments of 
sequence, e.g., by chemical synthesis or by the manipulation of isolated segments of 
nucleic acids by genetic engineering techniques. 

As used herein, "protein" or "polypeptide" are used synonymously to mean any 
peptide-linked chain of amino acids, regardless of length or post-translational 
modification, e.g., glycosylation or phosphorylation. 
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By the term "estrogenic" is meant acting to produce the effects of an estrogen. An 
"estrogenic agent" and an "estrogen mimic" is a substance that acts to produce the effects 
of an estrogen. 

As used herein the term "androgenic" means acting to produce the effects of an 
androgen. An "androgenic agent" and an "androgen mimic" is a substance that acts to 
produce the effects of an androgen. 

When referring to hybridization of one nucleic acid to another, "low stringency 
conditions" means in 10% formamide, 5X Denhardt's solution, 6X SSPE, 0.2% SDS at 
42° C, followed by washing in IX SSPE, 0.2% SDS, at 50° C; "moderate stringency 
conditions" means in 50% formamide, 5X Denhardt's solution, 5X SSPE, 0.2% SDS at 
42° C, followed by washing in 0.2X SSPE, 0.2% SDS, at 65° C; and "high stringency 
conditions" means in 50% formamide, 5X Denhardt's solution, 5X SSPE, 0.2% SDS at 
42° C, followed by washing in 0.1X SSPE, and 0.1% SDS at 65° C. The phrase 
"stringent hybridization conditions" means low, moderate, or high stringency conditions. 

Although methods and materials similar or equivalent to those described herein 
can be used in the practice or testing of the present invention, suitable methods and 
materials are described below. All publications, patent applications, patents and other 
references mentioned herein are incorporated by reference in their entirety. In the case of 
conflict, the present specification, including definitions will control. The particular 
embodiments discussed below are illustrative only and not intended to be limiting. 

BRIEF DESCRIPTION OF THE DRAWINGS 
FIG. 1 is a series of macroarrays demonstrating gene expression profiles from 
SHM exposed to E 2 , 17a-ethynyl estradiol (EE 2 ), diethylstilbestrol (DES), para- 
nonylphenol (pNP), methoxychlor (MXC), endosulfan (ES) or untreated control fish. 
Three separate fish were used for each treatment. 

FIG. 2 is two graphs showing quantification of the E 2 , EE 2 , DES, pNP, MXC, ES 
and control arrays for SHM. Panel A is a plot of the mean ±SEM intensity values for 
each of the cDNA clones arranged in order of their expression. Panel B is a plot of the 
mean intensity values for each of the cDNA clones for E 2 , EE 2 , DES, pNP, ES, or MXC 
divided by the mean intensity values of the respective cDNA clones for untreated control 
fish. Any clones above the line labeled 1.66 were considered up-regulated genes, any 
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clones below the line labeled 0.42 were considered down-regulated genes, and any clones 
between these lines were considered constitutive. Genes on the macroarray were 
designated as constitutive if their intensity values fell within the range of the mean plus 
one standard deviation of the highest and lowest values of the 1 1 clones that were used to 
normalize the data. 

FIG. 3 is a series of graphs plotting the quantification of the EE 2 dose response 
arrays for SHM. Each graph contains a plot of a gene whose expression levels 
significantly changed more then 2-fold at one or more of the three EE 2 concentrations 
compared to controls as revealed by one way analysis of variance (PO.05). AMBP= 
alpha- 1-microglobulin/bikunin precursor protein. The data on both axes are plotted using 
a logio scale. 

FIG. 4 shows arrays on a plot for control and E 2 -treated fish and the results from 
the array analysis. Panels A and B are arrays that were hybridized with RNA from 
control (Methylene glycol (TEG)-treated) and E 2 -treated SHMs, respectively. Panel C is 
a plot of the mean intensity value of each cDNA clone on the E 2 -treated blots (N=3) over 
the mean intensity value of each cDNA clone on the control (TEG treated) blots (N=2). 
The black circles in panel C represent the 17 cDNA clones that were identified by DD 
analysis to be constitutive. Any clones above the dotted gray line labeled 1.27 were 
considered E 2 up-regulated genes, any clones below the dotted gray line labeled 0.83 
were considered E 2 down-regulated genes, and any clones between the two gray dotted 
lines were considered constitutive genes. In panel C, a is transferrin, b is vitellogenin 
(Vtg ) p, c is ZP2, and d is vitellogenin a. There is a break in the graph of panel C from 2 
to 10 log (intensity) units. 

FIG. 5 is two graphs showing gene expression profiles from control and E 2 - 
treated male LMB. (A) shows the mean ±SEM intensity values for each of the cDNA 
clones arranged in order of their expression (black circles are E 2 , gray circles are control); 
(B) illustrates the mean intensity values for each of the cDNA clones for E 2 divided by 
the mean intensity values of the respective cDNA clones from control fish. Any genes 
outside of the upper and lower solid gray lines in the figure change by more then two-fold 
and are considered to be up or down-regulated. Genes that exhibited a significant change 
in expression at P < 0.05 are shown by a double asterisk; whereas genes that exhibited a 
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significant change in expression at P < 0.1 are shown by a single asterisk (t-tests). Three 
separate fish were used for each treatment. Only genes that were found in at least one of 
the treatments to be at least three standard deviations from the mean of the 12 ribosomal 
protein (r-protein) genes used to normalize the data (0.98 ±0.41) are plotted. AR = 
5 androgen receptor, ER = estrogen receptor, and NADH = Nicotinamide Adenine 
Dinucleotide (reduced form). 

FIG. 6 is two graphs showing gene expression profiles from control and 4-NP- 
treated male LMB. The order of genes in this figure corresponds to the order in Fig. 5. 

FIG. 7 is two graphs showing gene expression profiles from control and p, p'- 
1 0 DDE treated male LMB. The order of genes in this figure corresponds to the order in 
Fig. 5. 

FIG. 8 is two graphs showing gene expression profiles from control and p, p'- 
DDE treated female LMB. The order of genes in this figure corresponds to the order in 
Fig. 5. 

1 5 FIG. 9 is a list of genes whose expression is increased or decreased more than 

two-fold following exposure of LMB to E 2 , 4-NP, and p,p'-DDE. 

DETAILED DESCRIPTION 
The invention is premised in part on the discovery of nucleic acids (e.g., those of 
SEQ ID NOs: 1-560) whose expression is modulated in response to estrogenic/androgenic 

20 agents in fish such as SHM and LMB. Several of these nucleic acids were not previously 
characterized. Thus, the invention includes these nucleic acids, variants of these nucleic 
acids, proteins encoded by these nucleic acids, antibodies against these proteins, as well 
as other embodiments that can be made by one of skill in the art having knowledge of 
these sequences. An important application of the discovery is an assay for detecting 

25 modulation of expression of these nucleic acids in order to analyze an environmental 
sample or uncharacterized sample molecule. Detection of such modulation in a 
biological sample indicates that the sample or molecule exerts a hormonal activity (e.g., 
estrogenic or androgenic activity) or an anti-hormonal activity (e.g., antiestrogenic, anti- 
androgenic activity). 

30 The below described preferred embodiments illustrate adaptations of these 

compositions and methods. Nonetheless, from the description of these embodiments, 
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other aspects of the invention can be made and/or practiced based on the description 
provided below. 

Biological Methods 
Methods involving conventional molecular biology techniques are described 

5 herein. Such techniques are generally known in the art and are described in detail in 

methodology treatises such as Molecular Cloning: A Laboratory Manual, 3 rd ed., vol. 1-3, 
ed. Sambrook et al., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 
2001; and Current Protocols in Molecular Biology, ed. Ausubel et al., Greene Publishing 
and Wiley-Interscience, New York, 1992 (with periodic updates). Various techniques 

10 using PCR are described, e.g., in Innis et al., PCR Protocols: A Guide to Methods and 
Applications, Academic Press: San Diego, 1990. PCR-primer pairs can be derived from 
known sequences by known techniques such as using computer programs intended for 
that purpose (e.g., Primer, Version 0.5, ©1991, Whitehead Institute for Biomedical 
Research, Cambridge, MA). Methods for chemical synthesis of nucleic acids are 

15 discussed, for example, in Beaucage and Carruthers, Tetra. Letts. 22:1859-1862, 1981, 
and Matteucci et al., J. Am. Chem. Soc. 103:3185, 1981. Chemical synthesis of nucleic 
acids can be performed, for example, on commercial automated oligonucleotide 
synthesizers. 

Novel Fish Genes 

20 As several new genes were identified and characterized in making the invention, 

the invention provides several purified nucleic acids from SHM and LMB that are 
modulated in response to androgenic/estrogenic compounds. SHM nucleic acids of the 
invention have the nucleotide sequences of SEQ ID NOs: 151-419, while LMB nucleic 
acids of the invention have the nucleotide sequences of SEQ ID NOs: 1-150, 420-560. 

25 Various assays described herein include a step of analyzing expression of a SHM 

or LMB gene modulated in response to an estrogenic or androgenic agent. Thus, 
polynucleotides that preferentially bind to nucleic acids encoded by the gene (e.g., 
mRNA, cDNA, DNA complements of cDNA, etc.) are also within the invention. Such 
polynucleotides can have the exact sequence of all or a portion of SEQ ID NOs: 1-560 or 

30 the complements of SEQ ID NOs: 1-560. Because hybridization of two nucleic acids 
does not generally require 100% complementarity, variants of such polynucleotides are 
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also within the invention. These might include naturally occurring allelic variants of 
native LMB or SHM nucleic acids or non-naturally occurring variants that show 
sequence similarity to all or portions of SEQ ID NOs: 1-560 or the complements of SEQ 
IDNOs: 1-560 

Naturally occurring allelic variants of native LMB or SHM nucleic acids within 
the invention are nucleic acids isolated from LMB and SHM that have at least 75% (e.g., 
76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 
91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, and 99%) sequence identity with native 
LMB and SHM nucleic acids, and encode polypeptides having at least one functional 
activity in common with LMB and SHM polypeptides. Homologs of native LMB and 
SHM nucleic acids within the invention are nucleic acids isolated from other species 
(e.g., other fish species) that have at least 75% (e.g., 76%, 77%, 78%, 79%, 80%, 81%, 
82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 
97%, 98%, and 99%) sequence identity with native LMB and SHM nucleic acids, and 
encode polypeptides having at least one functional activity in common with native LMB 
and SHM polypeptides. Naturally occurring allelic variants of LMB and SHM nucleic 
acids and homologs of LMB and SHM nucleic acids can be isolated by using a library 
screen, other assays described herein, or other techniques known in the art. The 
nucleotide sequence of such homologs and allelic variants can be determined by 
conventional DNA sequencing methods. Alternatively, public or non-proprietary nucleic 
acid databases can be searched to identify other nucleic acid molecules (e.g., nucleic 
acids from other species) having a high percent (e.g., 70, 80, 90% or more) sequence 
identity to native LMB and SHM nucleic acids. 

Non-naturally occurring LMB and SHM nucleic acids variants are nucleic acids 
that do not occur in nature (e.g., are made by the hand of man), have at least 75% (e.g., 
76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 
91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, and 99%) sequence identity with native 
LMB and SHM nucleic acids, and encode polypeptides having at least one functional 
activity in common with native LMB and SHM polypeptides. Examples of non-naturally 
occurring LMB and SHM nucleic acids are those that encode a fragment of an LMB or 
SHM protein, those that hybridize to native LMB and SHM nucleic acids or a 



{WP104247;2} 



10 



complement of native LMB and SHM nucleic acids under stringent conditions, those that 
share at least 65% sequence identity with native LMB and SHM nucleic acids or a 
complement of native LMB and SHM nucleic acids, and those that encode an LMB or 
SHM fusion protein. 

Nucleic acids encoding fragments of LMB and SHM polypeptides within the 
invention are those that encode, e.g., 2, 5, 10, 25, 50, 100, 150, 200, 250, 300, or more 
amino acid residues of LMB or SHM polypeptides. Shorter oligonucleotides (e.g., those 
of 6, 12, 20, 30, 50, 100, 125, 150 or 200 base pairs in length) that encode or hybridize 
with nucleic acids that encode fragments of LMB or SHM polypeptides can be used as 
probes, primers, or antisense molecules. Longer polynucleotides (e.g., those of 300, 400, 
500, 600, 700, 800, 900, 1000, 1 100, 1200 or 1300 base pairs) that encode or hybridize 
with nucleic acids that encode fragments of LMB or SHM polypeptides can be used in 
place of native LMB or SHM polynucleotides in applications where it is desired to 
modulate a functional activity of native LMB or SHM polypeptides. Nucleic acids 
encoding fragments of LMB or SHM polypeptides can be made by enzymatic digestion 
(e.g., using a restriction enzyme) or chemical degradation of full length LMB or SHM 
nucleic acids or variants of LMB or SHM nucleic acids. 

Nucleic acids that hybridize under stringent conditions to the nucleic acid of SEQ 
ID NOs: 1-560 or the complement of SEQ ID NOs: 1-560 are also within the invention. 
For example, nucleic acids that hybridize to SEQ ID NOs: 1-560 or the complement of 
SEQ ID NOs: 1-560 under low stringency conditions, moderate stringency conditions, or 
high stringency conditions are within the invention. Preferred such nucleic acids are 
those having a nucleotide sequence that is the complement of all or a portion of SEQ ID 
NOs: 1-560. Other variants of LMB or SHM nucleic acids within the invention are 
polynucleotides that share at least 65% (e.g., 65, 70, 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 
97, 98, and 99%) sequence identity to SEQ ID NOs: 1-560 or the complement of SEQ ID 
NOs: 1-560. Nucleic acids that hybridize under stringent conditions to or share at least 
65% sequence identity with SEQ ID NOs: 1-560 or the complement of SEQ ID NOs: 1- 
560 can be obtained by techniques known in the art such as by making mutations in 
native LMB or SHM nucleic acids, by isolation from an organism expressing such a 
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nucleic acid (e.g., a fish expressing a variant of native LMB or SHM nucleic acids), or an 
organism other than a fish expressing a homolog of native LMB or SHM nucleic acids. 

Nucleic acid molecules of the present invention may be in the form of RNA or in 
the form of DNA (e.g., cDNA, genomic DNA, and synthetic DNA). The DNA may be 
double-stranded (ds) or single-stranded (ss), and if single-stranded may be the coding 
(sense) strand or non-coding (anti-sense) strand. The nucleic acid molecules of the 
present invention may also be polynucleotide analogues such as peptide nucleic acids 
(PNA). See, e.g. Gambari R., Curr. Pharm. Des. 7:1839-1862, 2001; U.S. patent number. 
6,395,474; and PCT patent application publication number WO 86/05518. The 
sequences which encode native LMB and SHM gene products may be identical to the 
nucleotide sequences shown in SEQ ID NOs: 1-560. They may also be different 
sequences which, as a result of the redundancy or degeneracy of the genetic code, encode 
the same polypeptides as the polynucleotides of SEQ ID NOs: 1-560. Other nucleic acid 
molecules within the invention are variants of nucleic acids of SEQ ID NOs: 1-560 such 
as those that encode fragments, analogs and derivatives of native proteins encoded by 
nucleic acids of SEQ ID NOs: 1-560. Such variants may be, e.g., a naturally occurring 
allelic variant of native nucleic acids of SEQ ID NOs: 1-560, a homolog of native nucleic 
acids of SEQ ID NOs: 1-560, or a non-naturally occurring variant of native nucleic acids 
of SEQ ID NOs: 1-560. These variants have a nucleotide sequence that differs from 
native nucleic acids of SEQ ID NOs: 1-560 in one or more bases. For example, the 
nucleotide sequence of such variants can feature a deletion, addition, or substitution of 
one or more nucleotides of native nucleic acids of SEQ ID NOs: 1-560. Nucleic acid 
insertions are preferably of about 1 to 10 contiguous nucleotides, and deletions are 
preferably of about 1 to 30 contiguous nucleotides. 

Probes and Primers 
Nucleic acids that hybridize under stringent conditions to the nucleic acid 
sequences of SEQ ID NOs: 1-560 or the complement of the nucleic acid sequences of 
SEQ ID NOs: 1-560 can be used in the invention. For example, such nucleic acids can be 
those that hybridize to the nucleic acid sequences of SEQ ID NOs: 1-560 or the 
complement of the nucleic acid sequences of SEQ ID NOs: 1-560 under low stringency 
conditions, moderate stringency conditions, or high stringency conditions. Preferred such 
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nucleic acids are those having a nucleotide sequence that is the complement of all or a 
portion of a nucleic acid sequence of SEQ ID NOs: 1-560. Others that might be used 
include polynucleotides that share at least 65% (e.g., 65, 70, 75, 80, 85, 90, 91, 92, 93, 
94, 95, 96, 97, 98, and 99%) sequence identity to a native nucleic acid sequence of SEQ 
ID NOs: 1-560 or the complement of a native nucleic acid sequence of SEQ ID NOs: 1- 
560. Nucleic acids that hybridize under stringent conditions to or share at least 65% 
sequence identity with the nucleic acid sequences of SEQ ID NOs: 1-560 or the 
complement of the nucleic acid sequences of SEQ ID NOs: 1-560 can be obtained by 
techniques known in the art such as by making mutations in a native nucleic acid 
sequence of SEQ ID NOs: 1-560, or by isolation from an organism expressing such a 
nucleic acid (e.g., an allelic variant). 

Methods of the invention utilize oligonucleotide probes (i.e., isolated nucleic acid 
molecules conjugated with a detectable label or reporter molecule, e.g., a radioactive 
isotope, ligand, chemiluminescent agent, or enzyme); and oligonucleotide primers (i.e., 
isolated nucleic acid molecules that can be annealed to a complementary target DNA 
strand by nucleic acid hybridization to form a hybrid between the primer and the target 
DNA strand, then extended along the target DNA strand by a polymerase, e.g., a DNA 
polymerase). Primer pairs can be used for amplification of a nucleic acid sequence, e.g., 
by the PCR or other conventional nucleic-acid amplification methods. 

PCR primers can be used to amplify the nucleic acid sequences of SEQ ID NOs: 
1-560 using known PCR and RT-PCR protocols. Such primers can be designed 
according to known methods as PCR primer design is generally known in the art. See, 
e.g., methodology treatises such as Basic Methods in Molecular Biology, 2nd ed., ed. 
Davis et al., Appleton & Lange, Norwalk, CN, 1994; and Molecular Cloning: A 
Laboratory Manual, 3rd ed., vol.1-3, ed. Sambrook et al., Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y., 2001. 

Probes and primers utilized in methods of the invention are generally 15 
nucleotides or more in length, preferably 20 nucleotides or more, more preferably 25 
nucleotides, and most preferably 30 nucleotides or more. Preferred probes and primers 
are those that hybridize to a native nucleic acid sequence of SEQ ID NOs: 1-560 (or 
cDNA or mRNA) sequence under high stringency conditions, and those that hybridize to 
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homologs of the nucleic acid sequences of SEQ ID NOs: 1-560 under at least moderately 
stringent conditions. Preferably, probes and primers according to the present invention 
have complete sequence identity with a native nucleic acid sequence of SEQ ID NOs: 1- 
560. However, probes differing from this sequence that retain the ability to hybridize to a 
5 native nucleic acid sequence of SEQ ID NOs: 1-560 under stringent conditions may be 
designed by conventional methods and used in the invention. Primers and probes based 
on the nucleic acid sequences of SEQ ID NOs: 1-560 disclosed herein can be used to 
confirm (and, if necessary, to correct) the disclosed nucleic acid sequences of SEQ ID 
NOs: 1-560 by conventional methods, e.g., by re-cloning and sequencing a native nucleic 
10 acid sequence of SEQ ID NOs: 1-560 or cDNA corresponding to a native nucleic acid 
sequence of SEQ ID NOs: 1-560. 

Proteins Encoded By Nucleic Acid Sequences Of SEQ ID NOs: 1-560 
The invention also provides polypeptides encoded in whole or in part by the 
nucleic acid sequences of SEQ ID NOs: 1-560. Some polypeptides encoded by the 
1 5 nucleic acids of SEQ ID NOs: 1 -560 are expressed at higher levels when the nucleic 

acids are exposed to hormonal compounds compared to control nucleic acids not exposed 
to the hormonal compound. Other polypeptides encoded in whole or in part by the 
nucleic acid sequences of SEQ ID NOs: 1-560 are expressed at lower levels when 
exposed to hormonal compounds compared to the expression of nucleic acids not 
20 exposed to the hormonal compound. 

Variants of native proteins encoded in whole or in part by nucleic acid sequences 
of SEQ ED NOs: 1-560 such as fragments, analogs and derivatives of native proteins 
encoded by nucleic acid sequences of SEQ ID NOs: 1-560 may also be used in methods 
of the invention. Such variants include, e.g., a polypeptide encoded in whole or in part 
25 by a naturally occurring allelic variant of a native nucleic acid sequence of SEQ ID NOs: 
1-560, a polypeptide encoded by an alternative splice form of a native nucleic acid 
sequence of SEQ ID NOs: 1-560, a polypeptide encoded in whole or in part by a 
homolog of a native nucleic acid sequence of SEQ ID NOs: 1-560, and a polypeptide 
encoded in whole or in part by a non-naturally occurring variant of a native nucleic acid 
30 sequence of SEQ ID NOs: 1-560. 
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Protein variants encoded by a sequence having homology to a nucleic acid 
sequence of SEQ ID NOs: 1-560 have a peptide sequence that differs from a native 
protein encoded in whole or in part by a nucleic acid sequence of SEQ ID NOs: 1-560 in 
one or more amino acids. The peptide sequence of such variants can feature a deletion, 
addition, or substitution of one or more amino acids of a native polypeptide encoded in 
whole or in part by a nucleic acid sequence of SEQ ID NOs: 1-560. Amino acid 
insertions are preferably of about 1 to 4 contiguous amino acids, and deletions are 
preferably of about 1 to 10 contiguous amino acids. In some applications, variant 
proteins substantially maintain a native nucleic acid sequence of SEQ ID NOs: 1-560- 
encoded protein functional activity. For other applications, variant proteins lack or 
feature a significant reduction in a nucleic acid sequence of SEQ ID NOs: 1-560-encoded 
protein functional activity. Where it is desired to retain a functional activity of a native 
protein encoded in whole or in part by a nucleic acid sequence of SEQ ID NOs: 1-560, 
preferred protein variants can be made by expressing nucleic acid molecules within the 
invention that feature silent or conservative changes. Variant proteins with substantial 
changes in functional activity can be made by expressing nucleic acid molecules within 
the invention that feature less than conservative changes. 

Nucleic acid sequences of SEQ ID NOs: 1-560-encoded protein fragments 
corresponding to one or more particular motifs and/or domains or to arbitrary sizes, for 
example, at least 5, 10, 25, 50, 75, 100, 125, 150, 175, 200, and 250 amino acids in length 
may be utilized in methods of the present invention. Isolated peptidyl portions of 
proteins encoded by a nucleic acid sequence of SEQ ID NOs: 1-560 can be obtained by 
screening peptides recombinantly produced from the corresponding fragment of the 
nucleic acid encoding such peptides. In addition, fragments can be chemically 
synthesized using techniques known in the art such as conventional Merrifield solid 
phase f-Moc or t-Boc chemistry. For example, a protein encoded by a nucleic acid 
sequence of SEQ ID NOs: 1-560 used in methods of the present invention may be 
arbitrarily divided into fragments of desired length with no overlap of the fragments, or 
preferably divided into overlapping fragments of a desired length. The fragments can be 
produced (recombinantly or by chemical synthesis) and tested to identify those peptidyl 
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fragments which can function as either agonists or antagonists of a native nucleic acid 
sequence of SEQ ID NOs: 1-560-encoded protein. 

Methods of the invention may also involve recombinant forms of the nucleic acid 
sequences of SEQ ID NOs: 1-560-encoded proteins. Recombinant polypeptides preferred 
by the present invention, in addition to native nucleic acid sequence of SEQ ID NOs: 1- 
560-encoded protein, are encoded by a nucleic acid that has at least 85% sequence 
identity (e.g., 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100%) with a 
native nucleic acid sequence of SEQ ID NOs: 1-560. In a preferred embodiment, variant 
proteins lack one or more functional activities of native nucleic acid sequence of SEQ ID 
NOs: 1-560-encoded protein. 

Protein variants can be generated through various techniques known in the art. 
For example, protein variants can be made by mutagenesis, such as by introducing 
discrete point mutation(s), or by truncation. Mutation can give rise to a protein variant 
having substantially the same, or merely a subset of the functional activity of a native 
protein encoded in whole or in part by a nucleic acid sequence of SEQ ID NOs: 1-560. 
Alternatively, antagonistic forms of the protein can be generated which are able to inhibit 
the function of the naturally occurring form of the protein, such as by competitively 
binding to another molecule that interacts with a protein encoded in whole or in part by a 
nucleic acid sequence of SEQ ID NOs: 1-560. In addition, agonistic forms of the protein 
may be generated that constitutively express one or more nucleic acid sequence of SEQ 
ID NOs: 1-560-encoded protein functional activities. Other protein variants that can be 
generated include those that are resistant to proteolytic cleavage, as for example, due to 
mutations that alter protease target sequences. Whether a change in the amino acid 
sequence of a peptide results in a protein variant having one or more functional activities 
of a native nucleic acid sequence of SEQ ID NOs: 1-560-encoded protein can be readily 
determined by testing the variant for a native nucleic acid sequence of SEQ ID NOs: 1- 
560-encoded protein functional activity. 

Antibodies 

Antibodies that specifically bind nucleic acid sequence of SEQ ID NOs: 1-560- 
encoded proteins can be used in methods of the invention, for example, in the detection 
of nucleic acid sequence of SEQ ID NOs: 1-560-encoded protein expression. Antibodies 
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of the invention include polyclonal antibodies and, in addition, monoclonal antibodies, 
single chain antibodies, Fab fragments, F(ab')2 fragments, and molecules produced using 
a Fab expression library. Antibodies can be of any immunoglobulin class including IgG, 
IgM, IgE, IgA, IgD and any subclass thereof. 

Antibodies that specifically recognize and bind to nucleic acid sequence of SEQ 
EDNOs: 1-560-encoded proteins are useful in methods of the present invention. For 
example, such antibodies can be used in an immunoassay to monitor the level of the 
corresponding protein produced by a cell or an animal (e.g., to determine the amount or 
subcellular location of a nucleic acid sequence of SEQ ID NOs: 1-560-encoded protein). 
Methods of the invention may also utilize antibodies, for example, in the detection of a 
nucleic acid sequence of SEQ ID NOs: 1-560-encoded protein in an environmental 
sample. Antibodies also can be used in a screening assay to measure the effect of a 
candidate agent on expression or localization of a nucleic acid sequence of SEQ ID NOs: 
1-560-encoded protein. 

Detecting the Presence of an Agent Having Androgenic/Estrogenic Activity 
Within the invention, SEQ ED NOs: 1-560 are used in various methods for 
detecting the presence of estrogenic/androgenic agents (e.g., EDCs) such as E 2 , EE 2 , 
DES, MXC, ES, 4-NP, p-chlorophenyl, and p,p'-DDE in a sample. Examples of other 
EDCs that may be detected using compositions and methods of the invention include 
benzenehexachloride, 1,2-dibromoethane, chloroform, dioxins, furans, octachlorostyrene, 
PBBs, PCBs, PCB, hydroxylated PBDEs, and pentachlorophenol as well as others 
disclosed in Hormonally Active Agents In The Environment, Ed. by The Committee On 
Hormonally Active Agents In The Environment Board On Environmental Studies and 
Toxicology Commission On Life Sciences And National Research Council, National 
Academy Press, Washington D.C., 1999. 

Methods for detecting the presence of an agent having estrogenic or androgenic 
activity in a sample involve a first step of providing at least one fish cell which was 
exposed to the sample. A fish cell of the invention can be a cell from any fish, preferably 
a cell from a SUM or LMB. A sample can be obtained from a number of sources, 
including a body of water (e.g., river, lake, stream, canal, estuary, pond, etc.) as well as 
sediment obtained from a body of water or from a site near or contacting a body of water 
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(e.g., sediment from a lake or river bed). The fish cell exposed to the sample can be a 
cell taken from a fish that was present in a body of water (or in contact with sediment) 
from which the sample (i.e., environmental sample) was taken. The fish cell can also be 
a cell isolated from a provided fish that was contacted with the sample (e.g., taken from a 
5 fish that was exposed to a sample in controlled, laboratory conditions). Alternatively, the 
fish cell can be one that was cultured and exposed to the sample in vitro. 

A second step of this method involves analyzing the at least one fish cell for 
expression of at least one gene encoded by a nucleotide sequence selected from the group 
consisting of SEQ ID NOs: 1-560. A number of methods for analyzing gene expression 
10 are described below. A third step of this method involves comparing the expression of 
the at least one gene in the cell compared to the expression of the at least one gene in a 
control cell not exposed to the sample or an agent having estrogenic or androgenic 
activity, wherein a difference in the expression of the at least one gene in the at least one 
fish cell compared to the expression of the same at least one nucleic acid in the control 
15 cell indicates that the sample contains an agent having estrogenic or androgenic activity. 

The step of analyzing the at least one fish cell can include analyzing the cell for 
expression of at least two (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 100) different genes, 
each being wholly or partially encoded by a nucleotide sequence selected from the group 
consisting of SEQ ID NOs: 1-560. To analyze the cell for expression of at least one 
20 gene, RNA transcripts can be isolated from the at least one cell. The isolated RNA 
transcripts or nucleic acids derived therefrom can be used as templates and contacted 
with at least one probe that hybridizes under stringent conditions to at least one 
nucleotidesequenceselectedfromthegroupconsistingofSEQIDNOs: 1-560. Thisstep 
can also include contacting the RNA transcripts or nucleic acids derived therefrom with 
25 at least two (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 50, 75, 100, 150) different probes 
that each hybridize under stringent conditions to a different nucleotide sequence selected 
from the group consisting of SEQ ID NOs: 1-560. The at least one probe (or probes) or 
the isolated RNA transcripts (or nucleic acids derived therefrom) can be conjugated with 
a detectable label such as a fluorphore or a radioactive molecule or compound. The 
30 probe(s) can be immobilized on a substrate (e.g., array) before placed in contact with 
RNA transcripts isolated from a fish cell or control cell, or can be contacted with the 
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RNA transcripts in solution (e.g., real-time PCR assay) rather than in the presence of a 
substrate. Examples of substrates that may be used include nylon, nitrocellulose, glass, 
and plastic. 

In another method of detecting the presence of an agent having estrogenic or 
androgenic activity in a sample, the control cell not exposed to the sample or an agent 
having estrogenic or androgenic activity is analyzed for expression of at least one gene 
encoded by a nucleotide sequence selected from the group consisting of SEQ ID NOs: 1- 
560. For example, RNA transcripts can be isolated from the control cell and contacted 
with the RNA transcripts or nucleic acids derived therefrom using the isolated RNA 
transcripts as templates with at least one probe that hybridizes under stringent conditions 
to at least one nucleotide sequence selected from the group consisting of SEQ ID NOs: 1- 
560. This method can further include isolating RNA transcripts from the at least one fish 
cell and contacting the RNA transcripts isolated from the at least one fish cell (or nucleic 
acids derived therefrom) with at least one molecule that hybridizes under stringent 
conditions to at least one nucleotide sequence selected from the group consisting of SEQ 
ID NOs: 1-560. In some applications, the at least one probe is conjugated with a first 
detectable label and the at least one molecule is conjugated with a second detectable label 
differing in chemical structure from the first detectable label. In other applications, the 
RNA transcripts (or nucleic acids derived therefrom) isolated from the at least one fish 
cell are conjugated with a first detectable label and the RNA transcripts isolated from the 
control cell are conjugated with a second detectable label differing in chemical structure 
from the first detectable label. 

To compare expression of the at least one nucleic acid in the fish cell compared 
to the expression of the at least one nucleic acid in a control cell not exposed to the 
sample or an agent having estrogenic or androgenic activity, both 1) the amount of first 
detectable label associated with the RNA transcripts isolated from the control cell (or 
nucleic acids derived therefrom) and 2) the amount of second detectable label associated 
with the RNA transcripts isolated from the at least one fish cell (or nucleic acids derived 

therefrom) is quantified. 

In one example of comparing expression of the at least one nucleic acid in the fish 
cell to the expression of the at least one nucleic acid in the control cell, the labeled RNA 
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transcripts (or nucleic acids derived therefrom) isolated from the at least one fish cell and 
from the control cell are contacted e.g., on an array as described herein. Hybridization of 
the differentially labeled transcripts to the nucleic acids is then detected (e.g., using an 
imaging device such as a phosphor screen or autoradiographic film) and signal intensities 
are quantitatively analyzed (e.g., using a software program such as Atlaslmage™ 2.01 

Clontech, Palo Alto, CA). 

Among the traditional methods that can be employed for gene expression analyses 
are DD RT-PCR, nucleic acid arrays, quantitative PCR (e.g., real-time PCR), in situ 
hybridization, serial analysis of gene expression (SAGE), and subtractive hybridization. 
DD RT-PCR, for example, isolates differentially expressed genes using both arbitrary 
and anchored oligo-dT primers (Liang & Pardee, 1992; Liang et al., 1994; and Genome 
Analysis: A Laboratory Manual Series 1, ed: B. Birren et al., 1997, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY). A typical DD RT-PCR protocol involves 
several steps including reverse transcription using anchored oligo-dT primers, 
amplification of cDNA using one anchored and one arbitrary primer, electrophoresis of 
PCR products, purification of the product of interest, and cloning and sequencing of the 
product. In one method, DD-RT-PCR is performed with the RNAimage mRNA 
Differential Display system (GenHunter; Nashville, TN) using one-base anchored oligo- 
dT primers (Liang et al., 1994) as described previously (Denslow et al., 1999a; and 

Denslow et al., 2001). 

In vitro quantitation of gene expression can be performed using a number of real- 
time quantitative PCR assays. Real-time quantitative PCR assays typically involve 
labeling a target nucleic acid with a first fluorescing dye and labeling a probe with a 
second fluorescing dye. For example, Multiplex TaqMan® (Applied Biosystems, Foster 
City, CA) assays can be performed using the ABI PRISM® 7700 Sequence Detection 
System (Applied Biosystems, Foster City, CA), capable of detecting multiple dyes with 
distinct emission wavelengths. Some real-time quantitative PCR applications involve the 
use of fluorescence resonance energy transfer (FRET) between fluorochromes introduced 
into DNA molecules (e.g., molecular beacon assays). For a review of FRET techniques, 
see Vet et al., Expert Rev. Mol. Diagn. 2:77-86, 2002. 
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A preferred technique for detecting the presence of estrogenic compounds 
involves the use of nucleic acid arrays. Nucleic acid arrays allow the simultaneous 
monitoring of expression patterns of multiple genes from the same sample. Arrays are an 
appropriate tool for rapidly screening large numbers of genes. Examples of nucleic acid 
arrays include microarrays and macroarrays. Methods involving nucleic acid arrays are 
reviewed in Ringner et al., Pharmacogenomics 3:403-415, 2002; Epstein et al., Curr. 
Opin. Biotechnol. 11:36-41, 2000; Granjeaud et al., BioEssays 21:781-790, 1999; 
Hatakeyama K., Nippon Rinsho 57:465-473, 1999; DNA Microarrays: A Molecular 
Cloning Manual, ed: D. Bowtell and J. Sambrook, 2002, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY, and U.S Patent No. 6,410,229. The construction and use 
of nucleic acid arrays containing fish genes is described below. 

Arrays 

The nucleic acids (and proteins and antibodies) of the invention are preferably 
useful for assaying a sample for the presence of a hormonal agent (e.g., an estrogenic, 
sample in an environmental water sample). In this regard, nucleic acid-based assays are 
presently preferred. The invention thus provides a substrate having immobilized thereon 
at least one nucleic acid including a nucleotide sequence selected from the group 
consisting of SEQ ID NOs: 1-560 and complements thereof. A typical substrate having 
immobilized thereon at least one nucleic acid of the invention is an array of fish nucleic 
acids, including nucleic acids (e.g., genes and gene fragments) responsive to androgenic 
and estrogenic compounds. Arrays containing fish-derived nucleic acids responsive to 
androgenic and/or estrogenic compounds can be used in a number of applications. For 
example, the arrays can be used to monitor the presence and distribution of androgenic 
and estrogenic contaminants in the environment. The arrays can also be used to screen 
for synthetic or natural agents having androgenic or estrogenic activity. An example of 
an array provided by the invention is a macroarray containing LMB- or SHM-derived 
nucleic acids. On a preferred macroarray of the invention, a minimum number of 
nucleotides of 150 is included for each nucleic acid (e.g., 2, 10, 50, 75, 100, 125, 130, 
135, 140, 145, 150, 155, 160, 165, 170, 200, 250, 300, 350, 400 or more). A portion of 
the nucleic acids on the macroarray are responsive to estrogenic compounds. A list of 
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nucleic acids that may be contained within a macroarray of the invention is presented in 
Table III (SHM), Table II (LMB), and Table IV (SHM and LMB). 

To construct a cDNA macroarray, cDNA is first prepared from RNA. Techniques 
for preparing cDNA from RNA are widely known, and are described in methodology 
treatises such as Sambrook and Russell supra and Ausubel et al., supra. In one example, 
cDNA clones (e.g., miniprep cDNAs) derived from DD RT-PCR analysis (as described 
above) are PCR-amplified using primers specific to the cloning vector (e.g., pGEMT- 
Easy, Promega, Madison, WI). Any suitable thermocycling conditions that result in 
amplification of the desired product may be used. After completion of the PCR, the 
products are purified (e.g., in a spin-column, Quiagen, Chatsworth, CA) and then 
concentrated (e.g., in a speed-vac). Aliquots of the PCR products are then resolved 
electrophoretically (e.g., run on a 1.2% agarose gel containing 0.3 mM ethidium 
bromide). The resultant gels are analyzed (e.g., digitally imaged using a UVP Bio Doc-It 
camera, Ultra Violet Laboratory Products, Upland CA) and the concentration of each 
PCR product is determined. Typically, concentrations of PCR products are determined 
by comparing the intensity of each band to a standard curve derived from a low DNA 
mass ladder (InVitrogen Corporation, Carlsbad, CA). 

Once the PCR products are purified and their concentrations determined, they are 
then spotted onto a membrane (e.g., nylon membrane). Methods for spotting cDNAs 
onto membranes are discussed in Diehl et al., NAR 29.E38, 2001; Shieh et al., 
Biotechniques 32:1360-1362 & 1364-1365, 2002; and Schuchhardt et al, NAR28:E47, 
2000. In one method of spotting the cDNAs onto a membrane, PCR products are 
denatured, quenched on ice, and robotically spotted onto nylon membranes (Fisher 
Scientific). In this method, membranes are cross-linked and stored under vacuum at 
room temperature until the hybridization step. Various controls are also spotted onto the 
membranes. These controls provide information about cDNA labeling efficiency, 
blocking at the pre-hybridization step, and non-specific binding. Any genes that are not 
responsive to estrogen may be used as negative control genes on an array of the 
invention. Control genes that are not responsive to estrogen include Arabidopsis thaliana 
cDNA clones, Cot-1 repetitive sequences, polyA sequence (SpotReport 3, Stratagene, 
LaJolla, CA), and a M13 sequence (vector but no cDNA insert). The consistency of the 
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spotting technique may be assessed by spotting on the array multiple cDNA products 
from the same gene that were amplified in separate PCR reactions. 

For the generation of probes, mRNA from fish exposed to an estrogenic 
compound (e.g., E 2 , EE 2 , DES, pNP, ES, MXC) and mRNA from control fish (i.e., fish 
not exposed to estrogenic compounds), is extracted and purified. mRNA may be purified 
by a number of known techniques, including the use of affinity columns (Qiagen, 
Chatsworth, CA). In addition to RNA probes, cDNA probes may also be used. The 
labeling of nucleotide probes is described in Relogio et al., NAR 30:351, 2002; and Yu et 
al., Mol. Vis. 8:130-137, 2002. Probes maybe labeled using any of a number of 
techniques, including fluorescence (e.g., Atlas Glass Fluorescent Labeling Kit, Clontech, 
Palo Alto, CA), resonance light scattering (Bao et al., Anal. Chem. 74:1792-1797, 2002), 
gold nanoparticle labeling (Fritzsche et al., J. Biotechnol. 1 :37-46, 2001) and radioactive 
methods. In one example of radiolabeling RNA probes, DNase-treated total RNA from 
fish is subjected to random primer labeling with a- 33 P dATP (Strip-EZ RT, Ambion, 
Austin, TX). RNAs may also be radiolabeled using a kit such as AtlasPure™ RNA 
Labeling System. Typically, blots are prehybridized for several hours, hybridized 
overnight with probe-containing solution, and then washed several times. 

To detect hybridization of the probe to nucleotides on an array, membranes are 
exposed to a suitable imaging device, such as a phosphor screen (Molecular Dynamics, 
Piscataway, NJ) or autoradiographic film for an appropriate period of time (e.g.,several 
hours). Signal intensities maybe quantitatively analyzed using a suitable software 
program, such as Atlaslmage™ 2.01 (Clontech, Palo Alto, CA). Blots may also be 
quantitatively evaluated using a Typhoon 8600 imaging system (Molecular Dynamics). 
For each nucleotide (e.g., cDNA) clone on an array, the general background of each 
membrane is subtracted from the average value of the duplicate spots on the membrane. 
The values are normalized to the average value of several (e.g., 1 1) nucleotide (e.g., 
cDNA) clones. Gene array data is analyzed using a suitable statistical analysis. For 
example, linear regression and one-way analysis of variance, with Tukey post-hoc 
analysis (SigmaStat and SigmaPlot, Jandel, CA) may be used to analyze the gene array 
data. 
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Determining Whether An Agent Has Estrogenic, Anti-Estrogenic, Androgenic, or Anti- 

Androgenic Activity 
In addition to detecting the presence of estrogenic compounds in the environment, 
nucleic acid arrays containing one or more nucleotide sequences of SEQ ID NOs: 1-560 
of the invention may also be used to screen for compounds with estrogenic, anti- 
estrogenic, androgenic, or anti-androgenic activity. Estrogenic compounds (e.g., 
estrogen, estrogen mimics) have possible uses in a number of disorders, including the 
treatment of cardiovascular disease, menopausal symptoms and menopausal osteoporosis. 
Molecules or compounds with anti-estrogenic activity (e.g., flavonoids) have a number of 
possible applications, including the treatment of breast cancer. Androgenic agents also 
have a number of applications, including the treatment of sexual dysfunction, depression 
and pelvic endometriosis. Androgenic agents are also fed to livestock as growth- 
inducing agents. For the treatment of prostate enlargement and acne, anti-androgenic 
agents are useful. 

A method for determining whether an agent has estrogenic, anti-estrogenic, 
androgenic, or anti-androgenic activity involves several steps. A first step in this method 
includes providing at least one fish cell. In a second step of the method, the at least one 
fish cell is contacted with the agent. In a third step, the at least one fish cell is analyzed 
for expression of at least one gene wholly or partially encoded by a nucleotide sequence 
selected from the group consisting of SEQ ID NOs: 1-560. A fourth step of the method 
includes comparing the expression of the at least one gene in the cell compared to the 
expression of the at least one nucleic acid in a control cell not exposed to the sample or 
an agent having estrogenic, anti-estrogenic, androgenic or anti-androgenic activity. In 
this method, a difference in the expression of the at least one nucleic acid in the at least 
one fish cell compared to the expression of the at least one nucleic acid in the control cell 
indicates that the agent has estrogenic, anti-estrogenic, androgenic, or anti-androgenic 
activity. 

In one embodiment of determining if a test agent increases or decreases 
expression of a gene responsive to estrogen, cells are first exposed to the test agent in 
vitro. For example, multiple compounds can be tested simultaneously by plating cells in a 
multi-well plate (e.g., in a 96 well tissue culture plate) and contacting one test compound 
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per well. RNA from the exposed cells as well as from control cells (i.e., negative control 
cells not exposed to the test compound and positive control cells exposed to the test 
compound) is isolated and reverse transcribed to cDNAs. The cDNAs are labeled to 
generate probes as described above, and contacted with the nucleic acid arrays of the 
invention. Hybridization of the labeled probes to the nucleic acids of the array (e.g., SEQ 
ID NOs: 1-560) is analyzed as described above. Alternatively, whole fish can be exposed 
to the test agents in the water or through the food. This allows for normal metabolic 
processes to occur within the various tissues of the fish to end up with an agent that has 
either the same or more or less activity then the parent agent. 

EXAMPLES 

The present invention is further illustrated by the following specific examples. 
The examples are provided for illustration only and are not to be construed as limiting the 
scope or content of the invention in any way. 

Example 1 - Expression Profiling of Estrogenic Compounds Using A SHM cDNA 

Macroarray 
Methods 

Amplification of cDNA to be spotted on macroarrays: Minipreps of 30 cDNA 
clones derived from DD RT-PCR analysis (Denslow et al., Gen. Comp. Endocrinol. 
121:250-260, 2001; Denslow et al., Comp. Biochem. Physiol. B. Biochem. Mol. Biol. 
129:277-282, 2001) were PCR amplified in a 300 reaction containing IX PCR Buffer 
A (Promega, Madison,WI), 2mM MgCl 2 (Promega, Madison, WI), 160 \M each 
deoxynucleotide triphosphate (dNTP) (Statagene, La Jolla, CA), 0.4 uM M13 primers 
(5'-GTT TTC CCA GTC ACG ACG TTG (SEQ ID NO:561) and 5'-GCG GAT AAC 
AAT TTC ACA CAG GA (SEQ ID NO:562), and 1.25 units Taq polymerase (Promega, 
Madison, WI). The PCR reaction conditions were: 1 cycle at 80°C (1 min); 1 cycle at 
94°C (2min); 32 cycles at 94°C (1 min) 57°C (1 min) 72°C (2 min); 1 cycle at 72°C (10 
min); and then hold at 4°C. After completion of the PCR reactions the products were 
purified in a spin-column (Qiagen, Chatsworth, CA) and then concentrated in a speed- 
vac. Aliquots of the PCR products were run on a 1.2% agarose gel containing 0.3 mM 
ethidium bromide. The gels were digitally imaged using a UVP Bio Doc-It camera (Ultra 
Violet Laboratory Products, Upland CA) and the concentration of each PCR product was 
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determined by comparing the intensity of each band to a standard curve derived from a 
low DNA mass ladder (Invitrogen Corporation, Carlsbad, CA). The PCR products were 
adjusted to a concentration of 160 ng/pL cDNA template. 

Spotting of the macroarrays: The PCR products were loaded into 96 well plates 
(Fisher Scientific, Pittsburgh, PA), denatured with 3 M NaOH, heated to 65°C for 10 
mins, and then immediately quenched on ice. 20 X saline sodium citrate (SSC) (3M 
NaCl, 0.3M sodium citrate, pH 7.0) containing 0.01 mM bromophenol blue was added to 
the samples to yield a final concentration of 0.3M NaOH, 6X SSC, and 100 ng/pL cDNA 
template. The PCR products were robotically spotted (Biomek 2000, Beckman Coulter, 
Fullerton, CA) in duplicate onto 1 1.5 by 7.6 cm neutral nylon membranes (Fisher 
Scientific) using 100 nL pins. Membranes were UV cross-linked at 1X10 5 pJoules (UV 
Stratalinker 1800, Stratagene, La Jolla, CA) and stored under vacuum at room 
temperature until hybridization. 

Array controls: Various controls were also spotted onto the membranes, which 
provided information about cDNA labeling efficiency, blocking at the pre-hybridization 
step, and non-specific binding. These controls included: 3 Arabidopsis thaliana cDNA 
clones, Cot-1 repetitive sequences, poly A sequence (SpotReport 3, Stratagene), and a 
M13 sequence (vector but no cDNA insert). The consistency of the technique was 
evaluated by spotting on the array multiple cDNA products from the same gene that were 
amplified in separate PCR reactions. 

Sample extraction: Total hepatic messenger ribonucleic acid (mRNA) was 
extracted using affinity columns (Qiagen, Chatsworth, CA) from adult male SHMs 
treated by aqueous exposure to either 65.14 ng/L of E 2 , 109 ng/L EE 2 , 100 ng/L DES, 
1 1 .81 ug/L pNP, 590.3 ng/L ES or 5.59 pg/L MXC as described previously (Folmar et 
al., Aquatic Toxicol. 49:77-88, 2000; Hemmer et al., Environ. Toxicol. Chem. 20:336- 
343, 2001). Three fish were used per treatment group. Criteria for selection of samples 
from each compound tested were based on previously generated dose response curves 
(Folmar et al., Aquat. Toxicol. 49:77-88, 2000; Hemmer et al., Environ. Toxicol. Chem. 
20:336-343, 2001) and chosen to give similar levels of expression of Vtg mRNA, a well 
established estrogenic biomarker (Bowman et al., Gen. Comp. Endocrinol. 120:300-313, 
2000; Sumpter and Jobling, Environ. Health Perspect. 103:173-178, 1995). By selecting 
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the concentration and length of exposure to yield similar Vtg mRNA expression levels, 
differing potencies among the chemicals tested was accounted for. Based on this 
criterion, length of exposure was four days for EE 2 and DES, five days for E 2 and pNP, 
and thirteen days for MXC. ES treatment levels ranging from 68.8 ng/L to 788.33 ng/L 

5 failed to induce Vtg mRNA. A treatment of 590.3 ng/L of ES for these analyses was 
chosen. This level of ES was slightly below the maximum acceptable toxicant 
concentration (MATC) derived for ES for SHMs (Hansen and Cripe 1991). 

Labeling of RNA and hybridization: Radiolabeled probes were generated by 
random primer labeling of DNase treated (DNA-/ree, Ambion, Austin,TX) total RNA 

10 from male SHM livers with [a- 33 P] dATP (Strip-EZ RT, Ambion, Austin, TX). The 
blots were prehybridized with ultraArray hybridization buffer (Ambion, Austin, TX) at 
64°C for 3 hours. Following prehybridization, each probe was diluted 20-fold with 10 
mM disodium ethylenediaminetetraacetate (EDTA), pH 8.0 to yield 1X10 6 cpm 
incorporated 33 P per mL hybridization solution. The diluted probes were heated to 95 °C 

15 for 5 mins, quenched on ice for 1 min, and added directly to the prehybridization buffer. 
The blots were then hybridized overnight at 64°C. Following hybridization, the blots 
were washed 4X15 minutes each with low (2X SSC, 0.5% SDS) and high (0.5X SSC 
and 0.5% SDS) stringency washes (Ambion, Austin, TX) at 64°C. 

Detection and normalization: The membranes were exposed to a phosphor screen 

20 (Molecular Dynamics, Piscataway, NJ) at room temperature for 48 hrs. The blots were 
quantitatively evaluated using a Typhoon 8600 imaging system (Molecular Dynamics, 
Piscataway, NJ). For each cDNA clone, the general background of each membrane was 
subtracted from the average value of the duplicate spots on the membrane. The values 
were normalized to the average value of 1 1 cDNA clones. These genes include 

25 ribosomal proteins L8, S9, two unique genes that are similar to ribosomal protein S9, and 
several clones that do not match any sequences in the National Center for Biotechnology 
Information (NCBI) database. These genes were chosen to normalize the data because 
they did not fluctuate appreciably (< 1.3 fold) on macro arrays from E 2 -treated and 
control fish and also were shown to be equally expressed in controls and treated fish by 

30 DD analysis data. Gene array data was analyzed using linear regression and one-way 
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analysis of variance, with Tukey post-hoc analysis (SigmaStat and SigmaPlot Jandel 
CA). 

Results 

As a first step toward using array technology, the variability between the 
5 macroarrays was determined. To accomplish this, aliquots of identical RNA samples 
were hybridized onto two separate membranes. A scatter plot correlating the intensity 
values for each spot on the two membranes was generated. The data points in the graph 
cluster along a slope of one for all of the spots, including both the low and highly 
expressed cDNA clones (R 2 = 0.94). Similar R 2 values ranging from 0.88-0.97 were 
1 0 observed in replicate experiments. 

cDNAs corresponding to thirty unique genes were spotted on the macroarrays. 
These genes were originally isolated by comparing gene expression profiles from control 
and Entreated fish by DD RT-PCR. Hepatic mRNA from exposed fish were radiolabeled 
and individually hybridized to membranes to determine if fish treated with E 2 , EE 2 , DES, 
pNP, MXC, and ES shared similar expression profiles. Three separate fish were used for 
each treatment. Figure 1 contains representative membranes from the different 
treatments and a graphical representation of the data is shown in Figure 2. Figure 2A 
illustrates the mean ±SEM intensity values for each of the cDNA clones arranged in order 
of their expression; Figure 2B illustrates the mean intensity values for each of the cDNA 
clones for E 2 , EE 2 , DES, pNP, MXC or ES divided by the mean intensity values of the 
respective cDNA clones from the untreated control fish. 

Several of the genes that were spotted on the array were found to be up or-down 
regulated in Entreated fish compared to controls. These genes were identified by 
comparing their intensity values to constitutive genes after correcting for intra-membrane 
differences based on the intensity values of 1 1 cDNA clones used to normalize the data. 
Genes on the macroarray were designated as constitutive if their fold-induction values 
fell within the range of the mean plus one standard deviation of the highest and lowest 
values of the 1 1 clones. Based on this criteria, any cDNA clones in the macroarray 
experiments above a ~1. 66-fold induction were designated as up-regulated genes 
respective to control fish, and any cDNA clones that had a value below -0.42 were 
designated as down-regulated. 
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Of the 30 genes used on the array, 6 genes were found to be up-regulated by E 2 
including Vtg a and p, choriogenin 2 and 3, ER a, and coagulation factor XI. Three 
genes found to be down-regulated by E 2 were transferrin, beta actin, and alpha-1- 
microglobulin/bikunin precursor protein. The remaining genes did not appear to be 
differentially regulated by E 2 when compared to controls. 

The 9 genes that were up or down-regulated by EE 2 , DES, pNP, and MXC 
exposures showed a similar pattern of expression to the E 2 treatment. Interestingly, 
ubiquitin-conjugating enzyme 9 was significantly (P<0.05) up-regulated only in the pNP 
treatments suggesting its regulation is not mediated through the ER. Eight of the nine 
genes that were found to be up or down-regulated for E 2 , EE 2 , DES, pNP, and MXC did 
not fluctuate for ES-treated fish, but instead resembled the pattern observed in control 
fish. The primary exception was ER a, which appeared to be up-regulated for all of the 
compounds, including ES. An additional gene, 3-hydroxy-3-methylglutaryl CoA 
reductase, appeared to be slightly down-regulated in fish treated with ES compared to all 
of the other treatments and the controls. 

To determine if the gene expression profiles on the array could be verified by 
other techniques that monitor mRNA expression, the expression profiles of several genes 
on the arrays were compared (Vtg a, choriogenin 2, and transferrin) to their profile by 
Northern blots and DD RT-PCR. Both Vtg a and choriogenin 2 mRNA levels increase 
in fish treated with E 2 , as measured by Northern blots and DD RT-PCR. Transferrin 
decreases with E 2 treatment, as measured by Northern blots and DD RT-PCR. 

To assess whether the arrays could be used as a quantitative tool to measure the 
expression of multiple genes at varying concentrations of an estrogenic chemical, male 
SHMs exposed for 4 days to either 24, 109, or 832 ng/L of EE 2 were examined (Folmar 
et al., Aquatic. Toxicol. 49:77-88, 2000; Hemmer et al., Environ. Toxicol. Chem. 20:336- 
343, 2001). Figure 3 contains graphical illustrations of genes whose expression levels 
significantly changed more than 2-fold in one or more of the three EE 2 concentrations 
examined (P<0.05). Vtg a and p, choriogenin 2, choriogenin 3, ER a, and clone ND107- 
B were found to increase in a concentration dependent manner in the EE 2 -exposed fish. 
Three other genes, transferrin, alpha- 1-microglobulin/bikunin precursor protein, and beta 
actin, appeared to decrease in a dose-dependent manner. These results were consistent 
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with the same genes that were up or down- regulated in the E 2 , DES, pNP, and MXC 

exposed fish (Figure 2). 

Example 2 - Expression Profiling of E 2 Using a SHM Array 

A SHM estrogen responsive macroarray was developed to investigate the 
feasibility of applying array technology in monitoring the environmental distribution of 
endocrine disrupting compounds that mimic estrogen. 

Total hepatic mRNA was extracted from 5 adult male SHMs treated by aqueous 
exposure to 100 ng/L of E 2 dissolved in methylene glycol (TEG) for 5 days. Minipreps 
of 54 cDNA clones derived from DD analysis were PCR amplified using primers specific 
to the M13 sequence of the cloning vector (pGEMT-Easy, Promega, Madison,WI). After 
the PCR reactions the products were purified in spin-columns (Qiagen, Chatsworth, CA) 
and then concentrated in a speed-vac. The cDNA samples were denatured with NaOH, 
heated to 65°C for 10 min, and then immediately quenched on ice. 20X SSC (3M NaCl, 
0.3M sodium citrate, pH 7.0) that contained 0.01 mM bromophenol blue was then added 
to the samples to yield a final concentration of 0.3M NaOH, 6X SSC, and 100 ng/uL 
cDNA template. The samples were then robotically spotted (Biomek 2000, Beckman 
Coulter, Fullerton, CA) in duplicate onto neutral nylon membranes (Fisher Scientific, 
Pittsburgh, PA) using 100 nL pins. The membranes were UV cross-linked and then 
stored under vacuum at room temperature until hybridized. Various controls, which 
provided information about the cDNA labeling efficiency, blocking, and non-specific 
binding of the arrays, were also spotted onto the membranes. These controls included: 3 
Arabidopsis thaliana cDNA clones, Cot-1 repetitive sequences, poly A sequence 
(SpotReport 3, Stratagene, La Jolla, CA), and a Ml 3 sequence (vector but no cDNA 
insert). Labeling of RNA probes and hybridization of blots was performed as described 
in Example 1 . 

The inter-membrane process variability between macroarrays was determined by 
hybridizing aliquots of identical RNA samples onto two separate membranes. A scatter 
plot correlating intensity values between the membranes was generated. The data points 
in the graph clustered along a slope of one (R2 of 0.95, Sigma Stat, Jandel, CA), a result 
which indicates that there is very little variability between membranes. 
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To determine if the gene transcripts found to be up- or down-regulated initially by 
DD analysis reflect the same induction pattern when spotted onto array membranes, RNA 
from adult male SHMs aqueously exposed to 100 ng/L of E 2 dissolved in TEG were 
radiolabeled and hybridized to several membranes. Figures 4A and 4B contain blots of 
control (TEG-treated) and E 2 -treated fish, respectively. Figure 4C is a plot of the mean 
intensity values. Genes on the macroarray were designated as constitutive genes if their 
intensity values fell within the range of the highest (1.27) and lowest (0.83) value of the 
17 cDNA clones that were used to normalize the data. Based on this criteria, any cDNA 
clone in the macroarray experiments that had an intensity value above ~1 .27 was 
designated an E 2 up-regulated gene, and any cDNA clone that had a value below -0.83 
was designated an E 2 down-regulated gene. Of the 54 cDNA clones that were spotted on 
the array, 1 5 genes appeared to be up-regulated by E 2 , 32 clones appeared to be 
constitutive, and 7 genes appeared to be down-regulated by exposure to E 2 . All of the 
highly up-regulated genes, including vitellogenin a and p" and the choriogenic protein 
(ZP2) were also shown to be up-regulated on DD analysis. Interestingly, transferrin, a 
protein involved in iron transport that was identified to be down-regulated by DD 
analysis also appears to be down-regulated in response to E 2 on the macroarrays. 
Example 3 - Gene Expression Profiles of LMB Exposed to 4-NP and ICI 182,780 Using a 

LMB Array 

Experimental Design and Sample Collection: Adult male LMB were purchased 
from American Sports Fish Hatchery (Montgomery, Alabama) and maintained in 
fiberglass tanks as previously described (Larkin et al., Comp. Biochem. Physiol. B. 
Biochem. Mol. Biol. 133:543-557, 2002; Larkin et al, Marine Environ. Res. 54:395-399, 
2002; and Larkin et al., Comparative Biochemistry and Physiology 133:543-557, 2002). 
Array technology as a tool to monitor exposure of fish to xenoestrogens. Marine Environ. 
Res., 2002; Bowman et al., Mol. Cell. Endocrinol. 196:67-77, 2002). Each fish was 
injected IP with either 50 mg/kg 4-NP (Fluka, St. Louis, MO # 74430), the combination 
of 50 mg/kg 4-NP and 1.0 mg/kg ICI 182,780 (Tocris Cookson), or vehicle, which 
consisted of ethanol and DMSO (Sigma, St. Louis, MO #5879). Each dose was dissolved 
in 1 ml of ethanol and then diluted to the appropriate concentration with DMSO. The 
fish were euthanized by submersion in a water bath containing 50-100 ppm MS-22 48 
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hours post injection and sacrificed by a sharp blow to the head followed by cervical 
transaction. The livers were excised and immediately flash frozen in liquid nitrogen. The 
frozen tissues were stored at -80°C until RNA was isolated. 

RNA Isolation: Isolation of total RNA from liver tissue was performed with the 
RNA Stat-60 reagent (Tel-test). Briefly, 30mg - 50mg of tissue stored in RNA later was 
homogenized in 0.9 mis STAT 60, choloroform was added, and the mixture was 
centrifuged at 12,000g for 15 minutes at 4°C. The extraction process was repeated and 
the pooled RNA was added to 500 ul isopropanol and allowed to precipitate at -20° C for 
at least one hour. Following centrifugation at 12,000g for 50 minutes, the pellet was 
washed with 70% ethanol, air dried and resuspended in an appropriate volume (50 ul-120 
Hi of RNA secure. The samples were treated with RNA secure (Ambion, Austin TX 
#7010) to inactivate contaminating RNases. All isolated RNA was treated with DNase 
solution (Ambion, Austin, TX #1906) following the manufacture's protocol. For all 
RNA samples, the quantity and quality of total RNA was assessed by spectrophotometric 
readings at 260nm and by electrophoresis through a 1% formaldahyde agarose gel stained 

with ethidium bromide. 

Real-Time PCR: Real time PCR was performed using reagents and a 5700 
thermocycler purchased from Applied Biosystems (ABI, Foster City, CA). The 
nucleotide sequences of the primers for the ER subtypes and Vtg 1 are as follows: 5' 
GACTACGCCTCCGGCTATCAYTATGG (SEQ ID NO:563) AND 
5'CATCAGGTAGATCTCAGGGGGYTCNGCNTC (SEQ ID NO:564). Probes and 
primers for the ER subtypes and Vtg 1 are described in Bowman et al., Ecotoxicology 
8:399-416, 1999; and Bowman et al., Mol. Cell Endocrinol. 196:67-77, 2002. Each real 
time PCR reaction consisted of 0.01 - 0.2 ug of reverse transcribed total RNA from liver 
tissue, IX universal Taqman master mix (ABI, Foster City, CA), and primers and probes 
in a 25 ul reaction. To generate a standard curve, varying amounts of plasmid containing 
the specific cDNA inserts for each gene were used as template in the PCR reactions. For 
each gene, a 6 point standard encompassing a 1 x 10 6 fold range of approximately 25 - 
2.5 x 10 6 copies of cDNA was constructed. Each sample was run in duplicate and 
normalized 18s rRNA, also obtained by real-time PCR. Both the intra-assay and inter- 
assay variability never exceeded 10%. The final data is graphed as the mean and 
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standard error of the relative copies of each ER or Vtg mRNA per ug of total RNA. 
Statistical differences between the treatments were determined by one way analysis of 
variance with Dunnets post-hoc analysis. 

Amplification of cDNA to be spotted on the macro arrays: The macroarrays were 
prepared and printed as previously described (Larkin et al., Marine Environ. Res. 54:395- 
399, 2002). Briefly, the 132 LMB clones were PCR amplified using primers specific to 
the M13 sequence of the cloning vector (pGEMT-Easy, Promega, Madison,WI). After 
completion of the PCR reactions the products were purified using MultiScreen PCR 
plates (Millipore, Bedford, MA), concentrated, denatured with NaOH, heated to 65°C for 
10 min, and then immediately quenched on ice. 20 X SSC (3M NaCl, 0.3M sodium 
citrate P H 7.0) containing 0.01 mM bromophenol blue was added to the samples to yield 
a final concentration of 0.3M NaOH, 6X SSC, and 100 ng/uL cDNA template. The PCR 
products were robotically spotted (Biomek 2000, Beckman Coulter, Fullerton, CA) in 
duplicate onto neutral nylon membranes (Fisher Scientific, Pittsburgh, PA) using 100 nL 
pins. Membranes were UV cross-linked and stored under vacuum at room temperature 
until hybridization. 

Labeling of RNA and hybridization was performed as described in Example 1. 
The membranes were exposed to a phosphor screen (Molecular Dynamics, Piscataway, 
NJ) at room temperature for 48 hours. The blots were quantitatively evaluated using a 
Typhoon 8600 imaging system (Molecular Dynamics, Piscataway, NJ). For each cDNA 
clone, the general background of each membrane was subtracted from the average value 
of the duplicate spots on the membrane. The values were normalized to the average 
value of 12 cDNA clones specific to ribosomal genes, which included S2, S3, S8, S15, 
S16 S27 L4,L5,L8,L13,L21,andL28. Ribosomal genes were chosen to normalize 
the data because they do not appear to fluctuate appreciably (< 1 .3 fold) in response to 
estrogenic compounds. Genes were not included for analysis that had values less than the 
background value for two out of the three replicates and/or fluctuated more then two fold 
when aliquots of the same RNA were hybridized to blots printed at the beginning, 
middle, and the end of the array printing process. 

Measurement of ER and Vtg 1 mRNA by real-time PCR: Real-time PCR is a 
sensitive assay that can be used to quantitate expression levels of genes. Using this 
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technology, assays were designed to quantitate the expression of 4 genes, estrogen 
receptors alpha, beta, and gamma, and Vtg 1 in LMB following exposure to 4-NP and 4- 
NP/ICI 182,780. Using primers and probes specific to each gene it was possible to 
differentiate between the ER isotypes with no cross reactivity. Exposure of LMB to a 
single IP injection of 4-NP (50 mg/kg) significantly increased ER a by 80 fold (p < 0.05) 
after 48 hours when compared to controls. During the same time frame, the levels of 
both ER P and ER y decreased approximately 1 .3-fold and 2.6-fold respectively, however 
these changes were not statistically significant from controls. When the LMB were 
exposed to a combination of 4-NP (50 mg/kg) and the anti-estrogen ICI 182,780 (1.0 
mg/kg), the levels of ER a increased only 4-fold over controls (p < .08), suggesting that 
the anti-estrogen had interfered with the activation process. As with the 4-NP treatment, 
the expression of ER 0 and y decreased (1 .9-fold) but the values did not differ 

significantly from controls. 

Since the Vtg gene is an E 2 -responsive gene that is under transcriptional control 
by ERs in the liver, the expression levels of Vtg 1 were also determined by real-time 
PCR. Exposure to 4-NP increased message levels by approximately 40-fold over 
controls (p < .05), however, this induction was not repressed by the addition of ICI 
182,780. 

LMB gene array analysis: In order to further characterize the effects of 4-NP 
alone or in conjunction with ICI 1 82,780 on hepatic gene regulation in LMB, the 
expression of 132 genes was examined, many of which are estrogen responsive, by gene 
arrays. Total hepatic RNA isolated from control and exposed fish was radiolabeled and 
hybridized to the membranes. Of the 132 genes on the array, only genes that changed by 
at least 3 standard deviations from the mean of the 12 ribosomal genes that were used to 
normalize the data are included. These include several that are up or down-regulated by 
more than 2-fold, a conservative cutoff generally used for array interpretation. The mean 
and standard error for each gene for control and treated LMB was determined. The fold 
induction of each gene over controls for both the NP and NP/ICI 182,780 treatments was 
determined. 

In the 4-NP-treated fish (Fig. 6), 9 genes were up-regulated 2-fold or greater 
including 4 Vtgs, choriogenin 2, choriogenin 3, aspartic protease, signal peptidase, and 
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one unidentified clone designated 92-1 . Two genes were found to be down-regulated by 
4-NP including transferrin and clone 50-1. In the case of the mixture of 4-NP and ICI 
182,780, some genes that were up-regulated by 4-NP treatment alone were reduced, but 
not all. In fact, the expression levels of 4 Vtgs, 2 choriogenins, and transferrin were not 
affected at all; instead they appear to be expressed to the same levels as with the 4-NP 
alone. Vtg 1, 2, 2a, and 3 were induced approximately 74, 28, 37, and 2-fold over 
controls respectively. The levels of both choriogenins increased to values approximately 
35-fold over controls while aspartic protease was induced 16 fold over controls. 

Genes which were reduced by the mixture and that exhibited at least a 2-fold 
change in expression included aspartic protease, protein disulfide isomerase, integral 
membrane protein, methionine sulfoxide reductase, ER y, glucocorticoid receptor, aldose 
reductase, ER p, FK506 binding protein, and 21 unidentified clones. All of these genes 
except for clone 53-1 were down regulated by the addition of ICI 182,780 to the 4-NP. 
Example 4 - Gene Expression Analysis of LMB Exposed to E 2 and p,p'-DDE Using a 

LMB Array Materials and Methods 
Amplification of cDNA to be spotted on the macro arrays: The 132 clones of 
LMB genes in pGEM-T Easy plasmids were PCR amplified in a 300 uL reaction 
containing IX PCR Buffer A (Promega, Madison,WI), 2mM MgCl 2 (Promega, Madison, 
WI), 160 uM each dNTP (Statagene, La Jolla, CA), 0.4 uM M13 primers (5'-GTT TTC 
CCA GTC ACG ACG TTG (SEQ ID NO:?) and 5'-GCG GAT AAC AAT TTC ACA 
CAG GA (SEQ ID NO:?)), and 1.25 units Taq polymerase (Promega, Madison, WI). 
The PCR reaction conditions were 1 cycle at 80°C (1 min), 1 cycle at 94°C (2min), 32 
cycles at 94°C (1 min), 57°C (1 min), and 72°C (2 min), 1 cycle at 72°C (10 min), and 
then hold at 4°C. After completion of the PCR reactions the products were purified using 
Multiscreen PCR plates (Millipore, Bedford, MA) and then concentrated in a speed-vac. 
Aliquots of the PCR products were run on a 1.2% agarose gel containing 0.3 mM 
ethidium bromide. The gels were digitally imaged using a UVP Bio Doc-It camera (Ultra 
violet Laboratory Products, Upland CA) and the concentration of each PCR product was 
determined by comparing the intensity of the gel band to a standard curve derived from a 
low DNA mass ladder (Invitrogen Corporation, Carlsbad, CA). The PCR products were 
adjusted to a concentration of 160 ng/uL cDNA template. 
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Spotting of the gene arrays and various controls used are described in Example 1. 
Chemicals, Treatment, and Preparation of the hepatic samples: E 2 (# E-8875) and p, p'- 
DDE (#12,389-7) were obtained from Sigma-Aldrich Corporation (St Louis, MO); 4-NP 
(#74430, 85% para isomer) was obtained from Fluka (Milwaukee, WI). 

Adult (~1 .5 year old) LMB weighing 300 ± 71 grams were obtained from 
American Sports Fish Hatchery (Montgomery, Alabama). Fish were acclimated for a 
minimum of one month in an aerated holding tank prior to treatment. The fish were 
exposed to ambient light and fed Purina Aquamax 5D05 fish chow (St. Louis, MO). 
Groups offish received a single IP dose of E 2 (2.5 mg/kg), 4-NP (50 mg/kg), or p, p'- 
DDE (100 mg/kg). E 2 and 4-NP were dissolved in lmL of 100% ethanol and then 
diluted to the appropriate concentration with DMSO (Sigma, St. Louis, MO # 5879), 
whereas p, p'-DDE was dissolved directly in DMSO. Control fish received an IP 
injection of the ethanol/DMSO or DMSO diluent without any chemical. During the 
experimental period the fish were not fed. 

The fish were euthanized 48 hours after the IP injection by addition of 50-100 
parts per million (ppm) of tricaine methanesulfonate (MS-222) to the water followed by a 
sharp blow to the head and cervical transection. The livers were excised from the fish 
and immediately flash frozen with liquid nitrogen. Total RNA was extracted from the 
tissue samples using RNeasy affinity columns (Qiagen, Chatsworth, CA). 

Labeling of RNA and hybridization was performed as described in Example 1. 
Detection and normalization was performed as described in Example 3. Transcript data 
were analyzed using linear regression and student t-tests (SigmaStat and SigmaPlot, 
Jandel, CA). 

Results 

Gene array technology has enabled researchers to analyze hundreds to thousands 
of genes on a single array. As a first step toward using array technology, the inter 
membrane variability between the gene arrays was determined. To accomplish this, 
aliquots of identical RNA samples were hybridized onto two separate membranes. A 
scatter plot correlating the intensity values for the cDNA clones between the two arrays 
was generated. The data points in the graph cluster along a slope of one starting with the 
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low to the high expressed cDNA clones (R2 of 0.98). Similar results were observed in a 

replicate experiment. 

In order to determine the specific expression profile of 132 unique genes in LMB 
exposed to E 2 , or to the contaminants 4-NP and p, p'-DDE, hepatic total RNAs from 
exposed fish were radiolabeled and individually hybridized to separate membranes. 
Three separate fish were used for each treatment. A graphical representation of this data 
is shown in Fig. 5. Figure 5A illustrates the mean ±SEM intensity values for each of the 
cDNA clones arranged in order of their expression; Fig. 5B illustrates the mean intensity 
values for each of the cDNA clones for E 2 divided by the mean intensity values of the 
respective cDNA clones from control fish. Only genes from any of the treatments (E 2 , 
NP or DDE) that were 3 standard deviations from the mean (0.98 ± 0.41) of the 12 r- 
protein genes that were used as constitutive controls are shown. While there are a 
number of genes whose expression levels meet this criterion, only genes that exhibit a 
two-fold or greater change in expression were considered to be differentially regulated. 
A two-fold cutoff is commonly used by researchers to demarcate up or-down regulated 
genes for array experiments (Nagahama, Y. Int. J. Dev. Biol. 38:217-229, 1994; Lin and 
Peter, Comp. Biochem. Physiol. B. Biochem. Mol. Biol. 129:543-550, 2001). Of the 132 
genes used on the array, 1 6 genes were up-regulated 2-fold or greater by E 2 including 
four Vtg genes, choriogenin 2, choriogenin 3, aspartic protease, protein disulfide 
isomerase, aldose reductase, and 7 unidentified clones designated 23-1, 24-1, 34-1, 92-1, 
101-1, 132-2, and 136-1. Two genes were down-regulated two-fold or more by E 2 
including transferrin and a clone designated 53-1. 

Since the mode of action of p, p'-DDE has not been extensively characterized, the 
influences of this compound on the expression profiles of the 132 genes arrayed in both 
male and female fish was examined. In male fish (Fig. 7), four genes were up-regulated 
by p, p'-DDE including Vtg 1, Vtg 2, choriogenin 2, and choriogenin 3, whereas one 
gene, clone 47-2 was down-regulated. In female fish (Fig. 8) injected with p, p'-DDE, no 
genes were identified as up-regulated; however, 17 genes were down-regulated two-fold 
or greater. These included the four Vtg 's, aspartic protease, transferrin, chemotaxin, 
choriogenin 2, androgen receptor, and 8 unidentified clones designated 50-1, 53-1, 71-1, 
101-1, 107-1, 118-1,120-1, and 128-1. 



{WP104247;2} 



37 



Summaries of the genes whose expression increased or decreases more than 2- 
fold for each exposure are depicted in Fig. 9. Light shading indicates down-regulated 
genes while dark shading indicates up-regulated genes. 

Example 5 - Altered Gene Expression In Liver of LMB Exposed To Androgens 

Methods 

Suppressive subtractive hybridization: Juvenile LMB were treated with a single 
50 pi intraperitoneal (IP) injection of either a 2 uM solution of dihydrotestosterone 
(DHT) or progesterone in DMSO (-2.5 nmol/g BWT). Fish were euthanized four days 
later and their livers were removed. Hepatic polyA+ RNA was isolated from these 
samples and subtractive hybridizations (Clontech, Palo Alto, CA) were performed in one 
direction using DMSO as the driver. The subtracted gene pools were then cloned into 
pGEM T-Easy (Promega, Madison, WI) and sequenced. Clones were identified using 
tBlastx at the National Center for Biotechnology Information (NCBI). 

Gene arrays: cDNAs obtained from SSH were arrayed as previously described 
(Larkin et al., Marine Environ. Res. 54:395-399, 2002) and then hybridized with 33P- 
labeled single-stranded cDNAs isolated from adult male LMB treated with 62.5 ug/g 
DHT or 20 ug/g 1 1-ketotestosterone (1 1-KT) or vehicle (DMSO) (n=5 per treatment). 
For each cDNA clone, the general background of each membrane was subtracted from 
the average value of the duplicate spots on the membrane. The values were then 
normalized to the average value of seven cDNA clones specific to ribosomal genes and 
the fold change calculated by dividing the mean of each treatment by the mean of the 
control. Those genes which changed by 2-fold or more were graphed. Significant 
differences (p<0.05) were determined by ANOV A and secondary testing was done by 
using Tukey's LSD. 

Results 

The results are shown in Table 1. Genes that were the most elevated include Vtg 
2, spermidine-spermine N 1 -acetyltransferase (SSAT), and ZPCs 1 and 4, while the LDL 
receptor, RXR interacting protein, and Vtg receptor were the most decreased. While the 
patterns of regulation appeared similar for both androgens, some specific differences did 
occur. For instance, aspartic protease and glutathione peroxidase III were up- and down- 
regulated, respectively, by DHT alone. Conversely, a fish homolog to pituitary tumor 
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transforming protein (PTTP) and cystatin were up- and down-regulated, respectively by 
1 1-KT alone. One gene that was up-regulated by both androgens was sSAT. This gene 
was shown to be unaffected by estradiol in the pig (Green et al., Biol. Reprod. 59:1251- 
1258, 1998). 

5 Table 1 Genes Up/Down Regulated By 1 1-KT and DHT 









TREATED WITH 


CHANGE 


LOG 


Gene ID 


E-score 


20MG/KG DHT 


BY11KT 


U I *T 


VTG PRECURSOR 


1.33E-39 


up 


up 


W 1 O 


SSAT 


0 


up 


up 


M7 
n r 


EST SEASONAL 64 


6.3 


up 


up 


r c. 


Q7-8 




up 


NC 


F12 


EST SEASONAL 88 


4 


up 


NC 


01 1 

w 1 1 


RIKEN 11 10001 M01 


2.61 E-05 


up 


up 


B14 


EST SEASONAL 62 


9.04 


up 


up 


LJ 1 o 


SOLUTE CARRIER 


2.29E-37 


up 


up 


F1? 


EST SEASONAL 56 


1.7 


up 


NC 


Gn 


RHAMNOSE BINDING LECTIN 


1.00E-43 


up 


NC 


vJ 1 O 


TFII1A 


1 .20E-30 


up 


NC 


H14 


ZPC1 


0 


up 


up 


U I o 


EST SEASONAL 9 


7.58 


up 


up 


114 


7PC4 


2.58E-25 


up 


up 


DQ 
DO 


EST SEASONAL 12 




up 


NC 


it 
1 1 


ASP PROT 




up 


NC 


ri4 


UNNAMED PROTEIN 


1 .48E-36 


up 


NC 


F1 


ATP AS E 6 


3.17E-18 


up 


NC 


FQ 


ATP AS E SUBUNIT 6 


5.13E-24 


up 


NC 


I VI I I 


RFTINOL DEHYDROGENASE 


9.07E-38 


up 


up 


l\/ 


ATP SYNTHASE 


1 .35E-08 


up 


NC 


r\ I o 


FSTP4 H07 
C_o I i *t nu/ 


2.16 


up 


up 




cot SEASONAL F21 


0.82 


down 


down 


ro 




6.92 


down 


NC 


I 0 


Al PHA 1 ANTITRYPSIN 


2.80E-27 


down 


NC 


D12 


RIKEN 2700038C09 


1.50E-04 


down 


NC 


M4 


EST SEASONAL 55 


1.32 


down 


down 


B2 


53-1 




down 


NC 


C2 


68-1 




down 


NC 


M5 


IGF-I 


7.50E-03 


down 


NC 


D6 


ESTP4 E06 


6.7 


down 


NC 


G9 


ESTP4 C04 


1.78 


down 


NC 


15 


HAPTOGLOBIN 


5.19E-28 


down 


NC 


K2 


ALDOLASE B 


0 


down 


NC 


C3 


APOLIPOPROTEIN E 


1.13E-26 


down 


NC 


K12 


EST SEASONAL 72 


6.57 


down 


down 


M2 


ALPHA TUBULIN 


2.93E-40 


down 


down 


G5 


GLUTATHIONE PEROXIDASE III 


0 


down 


NC 


K5 


ESTP4 E01 


3.43E+00 


down 


NC 
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L1 
L14 


24-1 

ER GAMMA 5' 2F 


2.49E-23 


down 
down 
down 


down 
down 
NC 


J5 
L3 


HEPCIDIN 
COMPLEMENT C3 


3 20E-04 


down 


down 


M12 


TFIID (change to liver regeration related 
protein) m „ — 


1 .88E-07 


down 


down 


K11 


EST SEASONAL 51 


0.39 


down 
down 


down 

Hnwn 

UUWI 1 


K1 
L11 


GP3 11C 

DVD IMTPRAPTINIf^ PROT 


1.36E-09 


down 


down 


M14 


\ /TO DO 




down 


down 


J10 


i r\i DO 
LDL KU 


1.62E-32 


down 


down 


L9 


COT OCAQHMAI QO 




down 
up 


NC 
NC 


N8 
010 


cot D/i nnft 
bo 1 r4 UUO 

DTTD 


3.12E-22 


up 


NC 


D4 


bo 1 un i D*t 


0.322 


up 


NC 


E14 


warm water acclim 


3.18E-12 


up 


up 


L8 


EST P4 06 




up 


MP 


H10 


EST SEASONAL 42 


0.011 


down 


NC 


B9 


EST SEASONAL F17 




down 


NC 


06 


EST SEASONAL 1 1 


0.36 


down 


NC 


03 


CYSTATIN 


8.39E-05 


down 


NC 



Example 6 - LMB And SHM Genes Up/down-regulated In Response 

Estrogenic Agents 

Table II LMB Gene Regulation 
5 



LMB# 


Gene ID 


Differentially 

expressed 

by 


LMB COMP FACTOR 
Bf/C2 


Putative complement factor 
Bf/C2 




LMB ABMP 


ABMP precursor 




LMB GLUT-PEROX III 


Glutathione peroxidase III 


Dn-reg DHT 


LMB_Srnp D1 


Small ribonucleoprotein D1 
polypeptide (16kD) 




LMB_RIBO L6 


Ribosomal protein L6 




LMBJVIYOSIN LIGHT 


myosin regulatory light 
chain 




LMB_ZPC1 


ZPC1 


up-reg 
DHT;11-KT 


LMB_CYTO-C OX 1 


Cytochrome c oxidase 
subunit I 




LMBJ.ECTIN STL2 


Rhamnose binding lectin 
STL2 


up-reg DHT 


LMB_EMAP2 


Echinoderm microtubule 
associated protein like 2 
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LMB ALDOLAbfc-B 


Aiooiase d 




LMB_RIBO L7A 


60S ribosomal protein L7A 




LMB PKOTHKOMdIin 


rroinronriDin precursor 




LMBSSAT 


CC AT 

ooA 1 


up-icy un I , 

11-KT 


LMB_COMPLEMENT- 


Complement C3 precursor 


Dn-reg DHT; 
11-KT 


LMB RIB0L7 


Ribosomal protein L7 




■ iin M A TO A OCT 

LM B_H- ATP AS fc- 

ci iri imit 


n+-A I rase suuunii, 
nlinnmvr'in ^pnsitivitv 

conferring protein 




LMB RIBOL23A 


Ribosomal protein L23a 




LMB_ALPHA-TUBULIN 


alpha tubulin 


Dn-reg 11 -kt; 
DHT 


LMB KIBO-Sa 


^•uo riDOSomai proiein oa 




LMB VTG 


Vitellogenin prcursor 




LMB NASCENT- 
POLYPEP 


Nascent polypeptide- 
associated complex, alpha 
polypeptide 




LMB_ApoH 


Apoliporotein H 




■ m a pj TOT DO 

LMBTBT-BP 


I b I -uinaing protein 




LMB_SOL-CAR-25A#5 


solute carrier family 25 

alnha mpmhpr *S 

CHLJI ICI HICII1UCI \J 


up-reg DHT; 
11-KT 


LMB_UNNAMED- 


Unnamed protein product 




LMB FIB-B-SUBUNIT 


Fibrinogen B subunit 




LMB_CIS-RETIN 


cis-retinoi oenyarogenase 


up-reg un i 


LMB_SENES-ASSOC 

PRHTPIM 
nr\Vj 1 CUM 


Putative senscence- 

aQ<5nriatprl nrotein 

ClOwvwIOlvU fJi Wlwll I 




1 h A O 1 Oil D^"» 

LMB LDL RC 


lul recepior 


Dn-rpn DHT* 
11-KT 


LMB ABC-TRANS 


ABC transporter 




LMB_CATHEPSIN B 


Cathepsin B 




LMB SERPIN-CP9 


Serpin CP9 




LMB_TFIIIA 


Transcription factor III A 
(TFIIIA) 




LMB_ANTITHROMBIN 

in 
111 


Antithrombin III 




LMB RIKEN 
1810056020 


RIKEN cDNA 1810056020 




LMB WEE-I 


Wee I tyrosine kinase 




LM B_H APTOGLOB I N 


Haptoglobin 


Dn-reg DHT 
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L IVI D_M r VJM- 1 


APOPI IPOPROTFIN A-l 




LMB ALPHA-1 
ANTITRYPSIN 


alpha -1 antitrypsin 
homolog precursor 


Dn-reg DHT 


1 IVJID ADAC 

LMD_ArUb 


Apoiipoproiein c 




LMB_ZPU4 




iirurpn DHT* 
up i c?y un i , 

11-KT 


LMB_LECTIN 9 


o-type lecun supenarTiiiy y 




1 KAD ATDAOC ft 

LMb A 1 rAot D 


a 1 rase suuuriii o 


i m-rpn l~}HT 


LMBJTI 


inter-alpha-trypsin inhibitor 
"ITI" 




LMB_EIF-3#7 


Eukaryotic translation 
initiation factor 3 subunit 7 




LMB HEPCIDIN 


Hepcidin precursor 


dn-reg DHT 


LMB_PTTP 


rituiiary lumor 
transform inn nrotein 




LMB_J0XIN-1 


Toxin-1 




LMB_COAG FACTOR 

V II 


Coagulation factor VII 




LMB_CDC42-2 


cdc 42 isoform 2 




LMB_W ARM-WATER 
ACC PROTEIN 


Warm water acclimation- 
related protein 


up-reg 11-KT 


LMB_CYTO-C OX II 


Cytochrome c oxidase 
subunit II 




LMB_L10A 


60S ribosomal protein 
L10A 




LMB KALLIKREIN 


Kallikrein 




LMB_DANIO EST 

OO IODOQ 


uanio bo i 
IMAGE'3818635 




■ yn A 1 DLJ A O 

LMB ALrHA-Z- 
MACROGLOB-1 


aipna-z-iTiacroyioiJUMi i- 1 




LMb MAr 1 UbLUB 
RELATED PROT 


napiogiODin-reidicu piuit;iii 




1 KAD CM AhACW D 

L M B_r 1 L AM h N - b 


riiamen d 




1 KAD 1 1 D IAI IITIM 

LMB UBIUUi 1 IN 


UDiquuin 




LMB_RXR INTERACT 

DDAT 


Retinoid X receptor 

infpmctinn nrntpin 
ii iici civ/iii ly piuidii 


Dn-reg 1 1 - 
KT; DHT 


LMB_MITOCHON- 

ATp QVMTWAC.F 


ATP synthase alpha chain 

mitnrhnnrlrial nrprur^or 

1 1 IHUlsl \\Jl IVJI IOI \J\ CVyUI Owl 


up-reg DHT 


LMB_TATA BOX BP 


TATA-box binding protein 




LMB DIFF-KEG 
TROUT PROT-1 


l"M If*-\ri^» r»f i»^llw rani ilot^/H trAi it 

ujiierentiaiiy reguiaiea iroui 
protein 1 




LMB LIVER-REGEN- 
REL PROT 


liver regeneration related 
protein 




LMB_SERPIN-2B 


Serpin 2b 




LMB_AP0-A1 


Apolipoprotein A-l-1 
precursor 
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LMB M-PHASE PROT 
6 


M-phase phosphoprotein 6 




LMB PROSTAGLAND- 
D-SYNTHASE 


Prostaglandin D synthase- 
like protein (lipocalin type) 




LMBJ.YRIC 


LYRIC 




LMB_CYSTATIN- 
PREC 


Cystatin precursor 


Dn-reg 11-KT 


LMB_RIKEN 2700038 


KllxtlN CUINA Zfuvvoo 




1 HAD RIA7CDAIM 

LM D_U 1 AZ. t r Alvl- 
BINDING INH1B 


IVloinui on It? doouuiaicu 

diazepam-binding inhibitor 




livid lor-i 


IVJl "I 




LIVID to I rn U I I 


F9TP4 D11 




i IV /ID CQTHUIT R 

LIVID to 1 Un 1 0 


CO 1 L/ll 1 u 




LMB ESTDHT 7 


ESTDHT 7 




LMB ESTDHT 13 


ESTDHT 13 




1 kiln rPTHUT CA 

LMB ESTDHT 5U 


to I uti i ou 




LMB_ESTDHT_51 


ESTDHT 51 




LMB ESTDHT 53 


to I Un 1 Do 




i tin roTrMJT 

LMB_ESTDHT_60 


to 1 Un 1 _JDU 




LMB_ESTDHT_62 


ESTDHT62 


up-reg DHT; 
11-KT 


LMB ESTDHT 68 


boTUn 1 bo 




LMB_ESTDHT_69 


rcTnuT cn 
bo 1 Un loy 




LMB_ESTP4_A02 


COTD/I A no 

bo 1 r4_AU£ 




1 HAD CCXD/1 DAQ 

LMb_to 1 r4_BUo 


CO 1 r *t DUO 




LMB_ESTP4_B04 


ESTP4_B04 




LMB ESTP4 B07 


ESTP4 B07 




LMB ESTP4 B08 


to 1 r4 duo 




LMB_ESTP4_B09 


ESTP4 B09 




LMB_ESTP4_C03 


ESTP4_O0o 




LMB_ESTP4_C04 


ESTP4_C04 




LMB_ESTP4_C06 


ESTP4_C06 




LMB ESTP4 D04 


ESTP4 D04 




LMB_ESTP4_D08 


ESTP4_D08 




LMB_ESTP4_D10 


ESTP4_D10 
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1 MR F°TP4 F01 


ESTP4 E01 




1 MR F°TP4 FO^ 

LIVID CO I i *t LUO 


FSTP4 E03 




LMB_ESTP4_E06 


ESTP4_E06 




1 MR FC.TP4 FOR 

LIVID CO 1 i *T LUO 


FSTP4 E08 




1 MR FCTP4 F19 
LIVID_Co I r^*f C I 


FSTP4 F19 




LMB_ESTP4_F06 


ESTP4_F06 




LMB_ESTP4_G06 


ESTP4_G06 




i md ccTD/i nil 
LIVID tolr4 ol I 


f<?tp4 mi 

CO 1 r *r O 1 1 




LMB_ESTP4_H02 


ESTP4_H02 




LMd_co 1 r4_nU4 


FQTP4 H04 
CO 1 r *t nUH 




LMB_ESTP4_H05 


ESTP4_H05 




LMB_ESTP4_H07 


ESTP4JH07 




LMB_ESTP4_H08 


ESTP4JH08 




i mr cot 
LMK Co 1 - 

SEASONAL_02 


F e T-°FA^nNAl 09 

CO 1 "OCnOUINnL 




1 RAD COT 

LMd to 1 - 

SEASONAL_03 


F^T-^FA^ONAI 0^ 

CO 1 "OCnOUINnL UO 




1 MR COT 

LMd col- 
SEASONAL_04 


POT CiFA^ONAI 04 

CO 1 "OlMOUINML U*t 




i mr cot 
LMd co I - 

SEASONAL_06 


F°T-°FA°ONAI 06 

CO 1 OC/AOUINML \J\J 




1 MR COT 

LMd col- 
SEASONAL_09 


FQT- 0 FA^nNAI 0Q 

CO 1 "OCnOUINnL UJ7 


i m-rpn DHT* 
11-KT 


LMB_EST- 


EST-SEAS0NALJ1 


dn-reg 11-KT 


LMB_EST- 

CFAQ,nNAI 19 


EST-SEAS0NALJ2 


up-reg DHT 


LMB_EST- 

°,FA°,ONAl -14 


EST-SEASONAL-14 




1 MR COT 

LMd co I - 
SEAS0NALJ6 


F c T-°FA c ONAl 16 

CO I "OCnOUINnL 1 U 




i mr cot 
LMd to 1 - 

SEAS0NAL_17 


F^T-^FA^DNAI 17 

CO 1 "OCnOUINnL 1 r 




LMB EST- 
SEASONAL_22 


EST-SEASONAL_22 




LMB EST- 
SEAS0NAL_51 


EST-SEAS0NAL_51 


dn 11-KT; 
DHT 


LMB EST- 
SEASONAL_52 


EST-SEASONAL_52 
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I MR FQT- 

SEASONAL_54 


F^T-^FAS-ONAl 54 




i mr fqt 

LIVID Co 1 

SEASONAL_55 


F^T-^FA^ONAI 55 


Dn-reo 11- 
KT; DHT 


1 MR COX 
LIVID to 1 - 

SEASONAL_56 


F^T-^FA^ONAI 5fi 
CO 1 "OCnOUINnL JO 


iin-rpn DHT* 


1 MR PQT-. 
LIVID CO 1 — 

SEASONAL_58 


AGONAL 58 




1 MR FQT— 
LIVID CO 1 — 

SEASONAL_59 


F^T-^FASONAL 59 




1 MR FQT 
LIVID CO I — 

SEAS0NAL_61 


F^T-9F AGONAL fi1 




I MR FQT 
LIVID CO 1 — 

SEASONAL_62 


F^T-^FA90NAl fi2 

CO 1 "OCnOVJINnL \JtL 




1 MR CQT 
LIVID CO 1 — 

SEASONAL_64 


F^T-^FA^ONAI fi4 

CO 1 OLnOUINnL u*+ 


i in-rpn 

DHT;11KT 


1 MR FQT 
LIVID CO 1 — 

SEASONAL_68 


F9T-9.FA9.ONAl 68 




1 MR FQT— 
LIVID CO 1 — 

SEASONAL_70 


F9T-9FA90NAL 70 
L\o 1 OLnuwi Nr\u / \j 




1 MR FQT 
LIVID CO 1 — 

SEASON AL_71 i 


F9T-9FA9.ni\JAI 71 

CO 1 OCnOV/INnL i l 




I MR FQT 
LIVID CO 1 — 

SEASONAL_72 


FQT-QFA90NAI 79 
CO 1 'OCnOUINnL / £. 


Dn-rpri DHT" 
11KT 


1 KMD COT 

LIVID CO 1 — 

SEASONAL_75 


FQT-QFA9HNAI 75 
CO 1 -OCnOUINnL 1 O 




I HD CQT 
LIVID CO 1 — 

SEASONAL_77 


F9T-9FA90NAI 77 

CO 1 "OCnOvlNnL 1 f 




i mr fqt 

LIVID CO 1 — 

SEASONAL_85 


F9T-9FA9DNAI 85 

CO 1 "OunOvlNnL OJ 




1 MR FQT 
LIVID CO 1 — 

SEASONAL_88 


F9T-9FA9nNAI 88 

CO 1 "OLnOvlNnL uu 


iin-rpo DHT' 


i mr fqt 

LIVID CO 1 — 

SEASONAL_90 


F9T-9FA90NAI Q0 

CO 1 "OCnOUINnL <j\J 


rin-rea DHT 


1 MR FQT 
LIVID CO 1 — 

SEASONAL_92 


F9T-9FA90NAI Q2 

CO 1 "OunOUINnL <j£- 




i mr fqt 

LIVID CO 1 — 

SEASONAL_97 


PQT QFA^HNAI Q7 

CO 1 -OCnOUINnL 57/ 




1 MR FQT 
LIVID CO 1 — 

SEAS0NAL_F1 1 


F9T-9FA90NAI F11 

CO 1 -OCnOUINnL- i 1 1 




I MQ CQT 
LMD CO I — 

SEAS0NAL_F17 


FQT QFAQOMAI F17 
CO I -OCnOUINnL r I / 


Dn-rpn 1 1-KT 

ui i i cy ii ix i 


1 MB EST— 

LIVID 1— O 1 

SEAS0NAL_F21 


EST-SEASONAL F21 


Dn-reg DHT 


LMB_ER-ALPHA 


ESTROGEN RECEPTOR 
ALPHA 


up-reg E2; 
NP 


LMB_ER-BETA 


ESTROGEN RECEPTOR 
BETA 
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LMB_ER-GAMMA 


ESTROGEN RECEPTOR 
GAMMA 


Dn-reg 11- 
KT; up-reg 
E2 


LMB_STAR 


STAR PROTEIN 


up-reg cAMP; 
dn-reg b- 
sitosterol 


LMB_SF1 


SF1 PROTEIN 
FRAGMENT 




LMB_ESTP4-E01 


LMB_ESTP4-E01 


down by DHT 


LMB_ESTDHT64 


LMB_ESTDHT64 


up by 11KT 


LMB LIV-REGER- 
PROT 


LMB_LIV-REGER-PROT 


down by DHT 


LMB_RIKEN 

a a a r\r\f\ a k ac\a 

1 1 10001 M01 


LMB_RIKEN 11 10001 M01 


up by 11KT; 
DHT 


I MR PQT 
LIVID Co I " 

SEASONALf 1 7 


1 MR CCT QPAQHMAI f 1 7 
LIvlD^to I -OtAokJINALil I 


aown uy 
11KT 


LMB1-3 


unknown 




LMB2-2 


unknown 




LMB3-1 


unknown 

Ul 11X1 1 




LMB4-1 


unknown 




LMB5 


vitellogenin-2A 


Up reg E2; 
NP; Dn-reg 
DDE(F) 


LMB6-1 


AM BP protein precursor 




LMB7-1 


Unknown 




LMB8-2 


Unknown 




LMB9-1 


Unknown 




LMB 10-1 


Unknown 




LMB 11-2 


Unknown 




LMB12-1 


Zebrafish Oligosaccharyl 
transfera^p intpciral 
membrane protein 




LMB 13-2 


Unknown 




LMB 14-1 


Unknown 




LMB 15-1 


NADH dehydrogenase 
subunit 1 




LMB 16-2 


unknown 




LMB 17-2 


Mitochondrial control region 




LMB 18-3 


unknown 
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LMB19-1 


insulin iiKe growin Tacior 




LMB20-1 


unknown 




LMB21-1 


unknown 




LMB22-1 


unknown 




I ft «noo A 

LMB23-1 


unknown 




LMB24-1 


Unknown 


Up-reg E2, 
dn-reg DHT; 
11-KT 


LMB25-1 


KiDosomai porotein oo 


• 


LIVIDZD- I 


Tr a n cf p rr i n 




I MD97 i 
LMDZ f - I 


Ul Iftl IUWI 1 




LMB28-2 


unknown 




LMB29-2 


unknown 




LMB30-1 


unknown 




LMB31 


choriogenin 




LMB32-1 


o-dox Dinaing Tacior 
fhacteria^ 




1 ft A D O O <4 

LMB33-1 


unknown 




LMB34-1 


un Known 


up icy 


LMB35-1 


unknown 




LMB36-1 


hypothetical protein 




1 HD07 4 

LMB37-1 


un Known 




LMB38-1 


40S ribosomal protein S2 




LMB39-1 


unknown 




LMB40-1 


alport syndrome chrom 
reoion oene 




LMB41-1 


ribosomal protein L8 
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LMB42-1 


Gamma fibrinogen 




LMB43-1 


FK506 binding protein, 
immunophillin 




1 KAO.AA 4 
LMD44- 1 


L/yiltilfl Heavy irlldlll 




LMB45-1 


vitellogenin A 




LMd4d-1 


unKnown 




LMB47-2 


unknown 


Down-reg 
DDE(M) 


LMB48-1 


elongation factor 1 beta 




LMB49-1 


40S ribosomal protein S15 




LMB50-1 


unknown 


Down reg 
NP; DDE(F) 


I MDK4 4 


unKnown 




LMB52-1 


unknown 




i nmco a 

LMB53-1 


unKnown 


uown-r eg. 
E2; DDE (F); 
DHT 


LMD04-2 


L4 riDOsomai 




LMB55-1 


L4 ribosomal 




LMB56-1 


40S ribosomal 




1 MOC7 

LMB57 


AUr.A i v iransiocase 




1 ft A O CO 4 

LMBoo-1 


riDosomai L/C \ 




1 MQCQ 4 


\jni\nowi 1 




LMdOU-1 


UllKilUVVN, Mr\U IUJJZ. 




1 hilDC'l A 

LMddI-1 


unknown 




LMB63-1 


unknown 




LMB64-1 


unknown 
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LMB65-1 


unknown 




LIVIDDD-l 


unKnown 




LMB67-1 


signal peptidase, 
endopeptidase 


Up-reg NP 


LMB68-1 


hypothetical protein 


Dn-reg DHT 


LMB69-2 


unknown 




1 K J07A O 

LMB70-2 


inaum aenyarogenase 

ci iKi mit 1 

oUUUI III 1 




LMB71-1 


unknown 




down-reg 
DDE (F) 


LIvId72-1 


unknown 




LMB73-1 


NADH dehydrogenase 

cr iKi in it 1 
oUUUIIIl I 




LIVID 1 H~0 


i inknn\A/n 

Ul IF\I IUWI 1 




LIVID / O I 


rihnQnmal 
*-rV/o i iuuoui i iai 




I MR7R-9 

LIVID r U t 


i inknnwn 

ui i r\ i iuvv 1 1 




I MR77-1 

LIVID r / - 1 


i inknnwn 
ui i rvi iuwi i 




LMB78-1 


unknown 




LMB79-2 


unknown 




LMB80-1 


unknown 




LMB81-1 


unknown 




LMB82-1 


unknown 




i a moo 

LMB83 


vitellogenin-2 


up-reg tz, 
NP- DDEfMV 
dn-reg 
DDE(F) 


LMB84-1 


STAR 




LMB85-1 


CYP1A 




LMB86-1 


ribosomal protein L28 




1 MDQ7 
LMDOf 


vitellogenin- i 


up icy l^, 

NP; DDE(M); 

dn-reg 

DDE(F) 


LMB88-1 


unknown 
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LMB89-1 


glucocorticoid receptor 




LMB90-1 


unknown 




I MRCM 1 
LMDy I - I 


Ul IUWI 1 




LMB92-1 


unknown 


up-reg E2;NP 


LMB93-1 


estrogen receptor gamma 




LMB94-1 


transferrin 


Down-reg. 
E2;NP; DDE 
in F 


LMB95-1 


CAP-rich Zinc finger protein 




LMB96-1 


unknown 




LMB97 


choriogenin-3 


Up-reg E2; 
NP* DDEfMV 
DHT 


Livibyo-i 


rtotrftrt^n r c\e* ar\\ f\r Koto 

esxrogen receptor Deid 




LMB99-1 


estrogen receptor alpha 




1 A A O A A A A 

LMB100-1 


ribosomal protein L5 




LMB101-1 


unknown 


up-reg c^, 
Dn for 
DDE(F) 


LMB102-1 


chemotaxin 


down-reg 
DDE (F) 


LMB103-1 


proteosome subunit 9 




LMB 104-3 


buo riDosomai proiem lio 




LMB105-1 


unknown 




LIVID \ 


univnuwii 


Hn\A/n-rpn 
DDE (F) 
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LMB 108-1 


choriogenin-2 


Up-reg E2; 
NP; DDE(M); 
Dn-reg DDE 
(F) 


LMB109-1 


40S ribosomal protein S3A 




LMB110-1 


Methionine sulfoxide 
reductase 




LMB112-1 


cathepsin (Aspartic 
protease) 


■ i ft ron C» KID' 

up-reg c, iNr. 
DHT; Dn- 
DDE (F) 


LMB116-1 


aldose reductase 


up-reg E2 


LMB118-1 


apolipoprotein precursor 


uown-reg 
DDE (F) 


LMB120-1 


hypothetical protein 


aonw-reg 
DDE (F) 


LMB121-1 


TBT binding protein 




LMB122-1 


alpha2-HS glycoprotein 




1 ft M t~% A *~» O A 

LMB123-1 


urocanase 




LMB128-1 


unknown 


down-reg 
DDE (F) 


LMB129-1 


unknown 




LMB130-1 


secreted phosphoprotein 
precursor 




LIVID I I 


intparin beta 


up-req E2 


LMB133-1 


unknown 




LMB134-3 


unknown 
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LMB135 


protein disulfide isomerase 


up-reg E2 


LMB136-1 


protein disulfide isomerase 
like 


I in r CO 

up-reg 


LMB137-2 


unknown 




LMB138-1 


unknown 




LMB139-1 


apolipoprotein C2 




LMB140-1 


unknown 




LMB141 


vitellogenin-3 


up reg E2; 
NP, dn DDE 
(F) 


LMB142-1 


hypothetical protein 

/ci nnc:Qn\ 




LMB144-1 


vitellogenin like 




LMB150 


androgen receptor 


Dn-reg 
DDE(F) 


LMB151 


vitellogenin receptor 


Dn-reg DHt- 
11-KT 



Table III SHM Gene Regulation 











Clone ID 


Identity 


E value 


Regulation 


SHM IK 7A 


40 S ribosomal protein S17 (Ictalurus 
punctatus) 


2.00 E-39 




SHM IK24E 


similar to ribosomal protein L37a, cytosolic 


4.00E-34 




SHM IK 25C 


ribosomal protein L5 


5 E-05 




SHM IK 5D 


60 S ribosomal protein L8 


5.00E-26 




SHM IKIGF-1 


IGF I 






SHM IKIGF-2 


IGF 2 






Female Test 
(SSH) 








ndSHM-FT1 -A03 


sertotransferrin precursor (0. Latipes) 


1.00E-99 




ndSHM-FT1-A09 


putative transmembrane 4 superfamily 
member protein 


7.88E-08 


up-reg- E2 
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ndSHM-FT1 -A10 


unknown 




up-reg-E2 


ndSHM-FT1-A11 


phospholipid hydroperoxide glutathione 
peroxidase 


5.61 E-44 


dn-reg E2 


ndSHM-FT1-A12 


sertotransferrin precursor (0. Latipes) 


3.90E-35 


dn-reg E2 


ndSHM-FT1 -B03 


Unknown 






ndSHM-FT1-B07 


Similar to aldehyde dehydrogenase 7 family, 
member A1 


0 




ndSHM-FT1-B10 


cytochrome b [Orestias silustani] 


0 




ndSHM-FT1-C01 


Similar to high mobility group box 1 [Danio 
rerio] 


0 




_ _i o i_] R /I rrT4 PA'S 

noSHM-FTi -OUo 


perronn i (pore Torming proiein^ nunidri,, 


dL.KJKJ 1 — 1 C7 


i in rpn F2 


ndSHM-FT1 -C04 


rrostagianain u oynmase [Aenopus laevisj 


1 .\J 1 L U J 


rin-rpn F9 
\j\ i i cy i — 


ndSHM-FT1-C09 


endoplasmic reticulum lumenal L-amino acid 
oxidase 


0 


dn-reg E2 


ndSHM-FT1-D0o 


Unknown 




i in-rpn F? 
u p i cy l.<c 


ndSHIvl-FT1-D1U 


unknown 




un-rpn F2 

up i cy I— 


ndSHM-FT1-D12 


unknown 




Hn-rpn F9 


ndSnM-rT 1-fcU 1 


prooaDie complement regulatory pidbiiia 
protein SB1 - 


/1 . 1 C7 L_ VJC 


rin-rpa E2 


ndSHM-rT1 -bOfti 


oyiocnrome o oxiaase suounu it 


? noF-fi? 

Z. .UU L_ \J£- 




ndSHM-FT1-E08 


unknown 




up-reg E2 


ndSHM-FT1 -E09 


unknown 




up-reg E2 


ndSHM-FT1-E12 


Similar to chitinase, (D. rerio) 


1.00E-83 


dn-reg E2 


ndSHM-FT1-F01 


leucine-rich alpha-2-glycoprotein [Homo 
sapiens] 


3.29E-13 




ndSHM-FT1 -F06 


complement component C3 [Paralichthys 
olivaceus] 


0 


dn-reg E2 


ndSHM-FT1-r09 


solute carrier tamiiy zf ijauy acia 

tranQnnrtpr^ tnpmhpr 

LI al lopV-JI LCI 1 1 ICI 1 IL/CI 


D, 1 OL_ 1 




ndSHM-FT1-F10 


beta hemoglobin A [Seriola quinqueradiata] 


1.00E-42 


dn-reg E2 


ndSHM-FTI-r 1 1 


unknown 




Hn-rpn F? 
ui i i cy \—£- 


ndSHM-FT1-F12 






up reg E2 


ndSHM-FT1-G02 


unknown 






ndSHM-FT1-G04 


Unknown 


2.15847 


up-reg E2 


ndSHM-FT1-G08 


endoplasmic reticulum lumenal L-amino acid . 
oxidase 


0 




ndSHM-FT1 -H02 


FUGRU complement component C9 
precursor 


3.00E-16 




_ jai it i f— ■ ~f* MAO 

ndSHM-FT1-H03 


oo kua serum lectin [Aenopus laevisj 






ndSHM-FT1-H04 


oimiiar to cnitinase, aciaiu \u. reno; 




Hn-rpn FP 
\ji i i cy i— £- 


ndSHM-FT1-H0o 


oHi--*: serine protease inniDitor (aa i-^uf ; 

[rxdLlUo 1 IU 


1 Q7F-0Q 




ndSHM-FT1-H07 

1 IIVI 1 1 1 I IV/ 1 


unknown 




up-reg E2 


ndSHM-FT1-H10 

i iuui iivi i tiii i \j 


Unknown 






nriSHM-FT1-H1 1 

1 IUOI IIVI 1 II 1 1 1 1 


unknown 




up-reg E2 


ndSHM-FT1-H12 


beta hemoglobin A 


1.40E-45 


up-reg E2 


ndSHM-MC1-A02 


Liver basic fatty acid bp 


2.00E-43 


dn-reg E2 


ndSHM-MC1-A03 


Polyadenylate-binding protein 1 


0 




ndSHM-MC1-A04 


unknown 




up-reg E2 


ndSHM-MC1-A05 


beta galactosidase/ubiquitin fusion protein 


3.00E-44 




ndSHM-MC1-A07 


Orla C3 (O. latipes) 


9.00E-38 
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nuoniVMviVM» i-MUi? 


fllnha-9-marrnnlohulin 2 (C caroio) 

CM \j\ la t 1 1 lavi uyivjuum i £- y w . vui ^yivyy 


2.00E-06 


dn-reg E2 ' 


ndSHM-MC1-A11 


alpha-1 -antitrypsin [Sphenodon punctatus] 


1.27E-11 


dn-reg E2 


ndSHM-MC1-B01 


unknown 




an-reg cz 


ndSHM-MC1-B03 


cytochrome c oxidase, subunit Va 


0 




ndSHM-MC1-B04 


KIAA0018 protein [Homo sapiens] 


9.29E-35 


up-reg E2 


ndSHM-MC1-B05 


Unknown j 




up-reg E2 


ndSHM-MC1-B08 


complement component C5-1 [Cyprinus 
carpio] 


5.61 E-30 




ndSHM-MC1-B10 


Serotransferrin precursor 
>gi|1 814091 |dbj|BAA1 0901 .1 1 


2.00E-39 




ndSHM-MC1-B11 


fibrinogen, B beta polypeptide 


o oner At^ 


r\r\ ran CO 

on-reg 


ndSHM-MC1-C02 


Similar to fibrinogen, gamma polypeptide 
[Danio rerio] 


4.06E-41 


up-reg-field 








an-reg 


ndSHM-MC1-C04 


4-hydroxyphenylpyruvate-dioxygenase 


U 




ndSHM-MC1-C05 


unknown 




i i r\ ron CO 

up-reg \Zc. 


ndSHM-MC1-C08 \ 


serine proteinase inhibitor CP9 - common 
carp 


8.08E-39 


dn-reg E2 


ndSHM-MC1-C10 


prothrombin precursor [Takifugu rubripes] 


0 




ndSHM-MC1-D01 








ndSHM-MC1-D02 


ATPase, H+ transporting, lysosomal, 


1 .47E-25 




ndSHM-MC1-D03 


fatty acid binding protein 2, hepatic 
(Japanese seapearch) 


2.00E-58 


up-reg-field 


ndSHM-MC1-D04 


Proteasome Regulatory Particle, ATPase- 
like 


0 




ndSHM-MC1-D06 


expressed sequence AL022852 [Mus 
musculus] 


1 .63E-21 


up-reg E2 


ndSHM-MC1-D10 


Scavenger receptor with C/type lectine type I 
^numanj 


nop iA 


r\ r\ ran CO 

an-reg tz^. 


naoriivi-ivn-» i-cu i 


cimilar tn mnnomrhriYvlatp trfln<>nortfir 6 


2.73E-17 




ndSHM-MC1-E05 


elastase 4 precursor [Paralichthys 


0 


up-reg field 


ndSHM-MC1-E06 


Unknown 






ndSHM-MC1-E08 


pre alpha inhibitor heavy chain 3 rat 


3.0UE-14 


on-reg tz 


ndSHM-MC1-E10 


Unknown 




up-reg tz 


ndSHM-MC1-E12 


unknown 




up-reg 


ndSHM-MC1-F01 


similar to charged amino acid rich leucine 
zipper factor- 1 


1.83E-11 


up-reg E2 


ndSHM-MC1-F02 


chemotaxis (O. mykiss) 


2.00E-60 




ndSHM-MC1-F03 


dendritic cell protein [Homo sapiens] 


0 




ndSHM-MC1-F06 


Chain A, Alcohol Dehydrogenase 


0 




ndSHM-MC1-F11 


17-beta-hydroxysteroid dehydrogenase type 
IV 


0 


up-reg E2 


ndSHM-MC1-F12 


interferon induced protein 2 [Ictalurus 
punctatus] 


2.49E-07 


up-reg E2 


ndSHM-MC1-G01 


Alcohol dehydrogenase >gi|482344| 


0 


up-reg 


ndbnM-lviOi -oUi 


<i a \y r> «-« onrklinnnrAtain TAnni tills i r^f^in iV^sl 

I4KU3 apoiipoproiein i/Anguiiid jafjumucaj 


1.45E-16 


dn-rea E2 


ndSHM-MC1-G03 


serine (or cysteine) proteinase inhibitor, 
clade F 


1.14E-29 


dn-reg E2 


ndSHM-MC1-G04 


ribosomal protein XL1a - African clawed frog 


0 




ndSHM-MC1-G05 


microfibrillar-associated protein 4 


3.27E-13 




ndSHM-MC1-G07 


apolipoprotein E [Scophthalmus maximus] 


4.21 E-37 
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ndSHM-MC1-G11 


aldehyde reductase AFAR2 subunit [Rattus 
norvegicus] 


3.34E-36 


up-reg E2 


ndSHM-MC1-G12 


Similar to RIKEN cDNA 1 30001 8K1 1 gene 
THorno sapiens] 


4.21 E-12 




ndSHM-MC1-H02 


unknown 






ndSHM-MC1-H03 


complement factor B/C2B (0. mykiss) 


8.00E-28 




ndSHM-MC1-H04 


similar to ribosomal protein S25, cytosolic 
[validated] - 


1.07E-23 


up-reg field 


ndSHM-MC1-H06 


unnamed protein product [Homo sapiens] 


0.000421546 


up-reg E2 


ndSHM-MC1-H08 


peroxisomal proliferator-activated receptor 
betal [Salmo salar] 


2.00E-08 




ndSHM-MC1-H09 


Ligand-gated ionic channel family member 


i .yo i oo 


UyJ icy 


ndSHM-MC1-H10 i 


unknown 


3.75692 


up-reg E2 


ndSHM-MC1-H12 


Similar to sperm associated antigen 7 [Homo 
sapiens] 


4.36E-18 




Male Test SSH 








ndSHM-MT1-A02 


chicken fatty acid binding protein 


3.00E-52 


up-reg E2 


ndSHM-MT1-A03 


warm temperature acclimation related 65 ! 
kDa protein (0. latipes) 


6.00E-40 




ndSHM-MT1 -A05 


Transducin beta/like 2 protein 


e-107 


up-reg E2 


ndSHM-MT1-B09 


putative mitochonrial inner membrane 
protease subunit (Human) 


1 .00E-34 


up-reg E2 


ndSHM-MT1-C05 


unknown 




i m-rpfi F? 


ndSHM-MT1-C08 


vitellogenin I [Cyprinodon variegatus] 




i m-rpfi F9 


ndSHM-MT1-D04 


WS beta-transducin repeats protein [Homo 
sapiens] 


1.87E-05 




ndSHM-MT1-D05 


mesau serum amyloid A/3 protein precursor 




i m-r£>n FP 
u\j i t?y 


ndSHM-MT1-D07 


vitellogenin (Sillago japonica) 


8.00E-78 


up-reg E2 


ndSHM-MT1-E02 


40 S ribosomal protein S3 


E-105 




ndSHM-MT1-E03 


Similar to transducin (beta)-like 2 [Xenopus 
laevisl 


0 


up-reg E2 


ndSHM-MT1-E05 


Predicted CDS, seven TM Receptor S 


3.00782 


up-reg E2 


ndSHM-MT1-F11 


Similar to transducin (beta)-like 2 [Xenopus 
laevis] 


1.42E-16 




ndSHM-MT1-G03 


60S ribosomal protein L10a >g 


2.40E-38 




ndSHM-MT1-H05 


Similar to transducin (beta)-like 2 [Xenopus 
laevis] 


0 




METHOXYCHLOR 
-CONTROL SSH 








ndSHM-MXCd- 
A04 


Protein involved in recombination repair, 
homoloaous to S. pombe rad18. 


0.0928205 


dn-reg E2 


ndSHM-MXCd- 
A09 


no hit 






ndSHM-MXCd- 
A10 


KIAA0096 gene product is related to a 
protein kinase. 


5.62E-12 


up-reg E2 


ndSHM-MXCd- 
A11 


alpha s HS glycogrotein (Platichthys flesus) 


1.00E-47 




ndSHM-MXCd- 
B02 


dodecenoyl-Coenzyme A delta isomerase 


3.82E-37 


up-reg E2 


ndSHM-MXCd- 
B03 


cytochrome P450 3A56 [Fundulus 
heteroclitus] 


0 


up-reg field 


ndSHM-MXCd- 
B04 


kallistatin [Rattus norvegicus] 


4.93156 


up-reg E2 
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ndSHM-MXCd- 
B06 


Fibrinogen beta chain precursor [Contains: 
Fibrinopeptide B] 


5.78E-24 


dn-reg E2 


ndSHM-MXCd- 
B07 


Apolipoprotein A/I precursor (sparus aurata) 


5.00E-25 


dn-reg E2 


ndSHM-MXCd- 
B08 


Similar to retinol dehydrogenase type ill 
[Danio rerio] 


4.16E-32 




ndSHM-MXCd- 
C04 


Beta-2-glycoprotein I precursor 
(Apolipoprotein H) ( 


1.92E-11 




ndSHM-MXCd- 
C06 


tyrosine kinase [Gallus gallus] 


1.31312 




ndSHM-MXCd- 
C11 


ceruloplasmin [Danio rerio] 


0 


up-reg field 


ndSHM-MXCd- 
D03 


vitellogenin I precursor (Mummichog) 


4.00E-51 


up-reg E2 


ndSHM-MXCd- 
D04 


hypothetical protein [Ferroplasma 
acidarmanus] 


0.826071 


dn-reg E2 


ndSHM-MXCd- 
D05 


cytochrome c oxidase subunit I [Engraulis 
japonicus] 


8.28E-35 


dn-reg E2 


ndSHM-MXCd- 
D08 


no hit 




up-reg E2 


ndSHM-MXCd- 
D10 


Immunoglobulin domain-containing protein 
family 


0.991091 


up-reg E2 


ndSHM-MXCd- 
D12 


hypothetical protein [Plasmodium falciparum 
3D7] 


8.1324 




ndSHM-MXCd- 
E01 


hypothetical protein [Magnetospirillum 
magnetotacticum] 


0.61028 




ndSHM-MXCd- 
E09 


unknown protein 




up-reg E2 


ndSHM-MXCd- 
E11 


sorting nexin 1 1 [Homo sapiens] 


0 




ndSHM-MXCd- 
F01 


vitellogenin B (M. aeglefinus) 


6.00E-16 


up-reg E2 


ndSHM-MXCd- 
F03 


warm-temperature-acclimation-related- 
protein- [Oryzias latipes] 


6.25E-25 


dn-reg E2 


ndSHM-MXCd- 
F07 


UDP-glucose pyrophosphorylase [Gallus 
gallus] 

W A 


0 




ndSHM-MXCd- 
F10 


interferon-related developmental regulator 1 
[Mus musculus] 


6.68E-39 


up-reg E2 


ndSHM-MXCd- 
G02 


unknown protein 


0.202018 


up-reg E2 


ndSHM-MXCd- 
G03 


thyroid hormone receptor interactor 12; 


0 


up-reg E2 


ndSHM-MXCd- 
G04 


Putative ribosomal protein L21 


0 




ndSHM-MXCd- 
G12 


putative delata 6-desaturase [Oncorhynchus 
masou] 


0 


up-reg E2 


ndSHM-MXCd- 
H05 


complement control protein factor l-A 
[Cyprinus carpio] 


1.72E-23 




ndSHM-MXCd- 
H09 


ATP synthase 6 


3.00E-23 




METHOXYCHLOR 
TEST SSH 








ndSHM-MXCtl- 
B05 


rat liver regeneration related protein 


1.00E-48 
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ndSHM-MXCU- 
B08 


BH2041 -unknown conserved protein 
[Bacillus halodurans] 


6.52356 


up-reg E2 


ndSHM-MXCH- 
C02 


lysophospholipase (Rat) 


1.00E-36 


up-reg E2 


ndSHM-MXCtl- 
C11 


Unknown protein for MGC:63946 (D. rerio) 


3.00E-29 




ndSHM-MXCU- 
D09 


unknown 




up-reg E2 


ndSHM-MXCtl- 
E04 


unknown 




up-reg E2 


ndSHM-MXCtl- 
E06 


CG4198-PA [Drosophila melanogaster] 


0.385852 




ndSHM-MXCH- 
E09 


PROBABLE IRON OXIDASE PRECURSOR 
OXIDOREDUCTASE PROTEIN 


3.15365 




ndSHM-MXCtl- 
E12 


Vitellogenin I precursor (VTG I) [Contains: 
Lipovitellin 1 ( 


0 


up-reg E2 


ndSHM-MXCtl- 
F11 


Unknown 




up-reg E2 


ndSHM-MXCtl- 
G03 


Unknown 






ndSHM-MXCtl- 
H03 


miro2 pending protein 


4.00E-60 




ndSHM-MXCtl- 
H09 


Group XIII secretory phospholipase A2 
precursor 


6.05E-40 


up-reg E2 


NONYLPHENOL 
CONTROL SSH 








ndSHM-NPd-A12 


unknown 






ndSHM-NPc1-B01 


NADH subunit 1 [Cyprinodon variegatus] 


2.80E-45 


up-reg field 


ndSHM-NPd-B08 


Chain A, Complex Of The Catalytic Portion 
Of Human 


1.20E-15 


up-reg field 


ndSHM-NPc1-B09 


calreticulin [Danio rerio] >gi|6470259|gb| 


0 




ndSHM-NPc1-C04 


hypothetical protein APE0566 - 


0.667761 




ndSHM-NPc1-C06 


Vitellogenin II precursor (VTG II) [Fundulus 
heteroclitus] 


0 


up-reg E2 


ndSHM-NPd-C11 


translation elongation factor 1 -alpha 
fStylonychia mytilus] 


7.14E-10 




ndSHM-NPc1-E01 


vitellogenin I [Cyprinodon variegatus] 


•i one 0.0 


up-reg 


ndSHM-NPd-E06 


Unknown 






ndSHM-NPd-E11 


Unknown 




up-reg 


ndSHM-NPc1-F01 


ubiquitin A-52 residue ribosomal protein 
[Homo sapiens] 


o nnc o7 




nOonlvi-NrCT -rUD 


VllcllUycl ill 1 r\ [IVIdcll 1Uy 1 ai 1 II 1 lUo acyidii iuoj 


0 000293022 


UD-rea E2 


ndSHM-NPc1-F06 


Unknown 




up-reg E2 


ndSHM-NPc1-F07 


LFA-3(delta TM) [Ovis sp.] 


{j.yjfooZZo 


up-reg 


ndSHM-NPc1-F08 


CG32659-PA [Drosophila melanogaster] 


0.0316684 




ndSHM-NPc1-G02 


ribophorin 1 [Danio rerio] 


0 




ndSHM-NPc1-G08 


KIAA1560 protein [Homo sapiens] 


6.27E-38 




ndSHM-NPd-G11 


ATP synthase alpha chain, mitochondrial 
precursor 


1.29E-23 




ndSHM-NPc1-H01 


similar to Tho2 [Homo sapiens] [Rattus 
norvegicus] 


2.32887 




ndSHM-NPc1-H02 


Transporter, truncation [Streptococcus 
pneumoniae R6] 


5.24069 


up-reg E2 



{WP104247;2} 



57 



ndSHM-NPd-H03 


Hemoglobin beta chain 
>qi|7439519|pir||S70614 


1.2944 




ndSHM-NPd-H04 


unknown 




up-reg E2 


ndSHM-NPc1-H05 


Cytochrome c >gi|65467|pir||C 


3.47E-32 




ndSHM-NPc1-H08 


choriogenin L (0. latipes) 


1.00E-70 


up-reg E2 


NONYLPHENOL 
TEST SSH 








ndSHM-NPt1-A01 


RIFIN [Plasmodium falciparum 3D7] 
>qi|23498329|e 


1.79528 




ndSHM-NPt1-A02 


P0699H05.18 [Oryza sativa (japonica 
cultivar-group)] 


0.244655 




ndSHM-NPt1-A03 


hypothetical aminotransferase 
[Bradyrhizobium japonicum] 


0.421189 




ndSHM-NPt1-A04 


unknown 




up-reg E2 


ndSHM-NPt1-A05 


serum amyloid A protein [Holothuria 
glaberrima] 


9.34E-14 


up-reg E2 


ndSHM-NPt1-A08 


unknwon 




up-reg E2 


ndSHM-NPt1-A09 


DNAse II homolog F09G8.2 [Caenorhabditis 
eleqansl 


0.656008 


up-reg E2 


ndSHM-NPt1-B02 


similar to peroxisomal long-chain acyl-coA 
thioesterase; peroxisomal long-chain acyl- 
coA thioesterase ; putative protein [Homo 
sapiensj 


2.30E-17 


up-reg E2 


nQonivi-iN"i i -duo 


rhorinnpnin Hminnr TOrvzias latioesl 


1.52E-14 


up-reg E2 


ndSHM-NPt1-B05 


tryptophan 2,3 dioxygenase 


1.00E-60 


up-reg E2 


ndSHM-NPt1-B06 


ATP synthase 6 (Pomacentrus trilineatus) 


9 nnp 9^ 




ndSHM-NPt1-B07 


unknown 




up-reg E2 


ndSHM-NPt1-B11 


embyonic epidermal lectin (X. laevis) 


4.00E-42 


up-reg E2 


ndSHM-NPt1-B12 


perlecan (heparan sulfate proteoggllycan 2 


2.00E-31 




ndSHM-NPt1-C01 


immunoglobulin light chain [Seriola 
quinqueradiata] 


1.38E-14 


up-reg E2 


ndSHM-NPt1-C03 


cytochrome c oxidase subunit 1 [Arcos sp. 
KU-149] >gi|25006169|dbj|BAC23776.1 1 
cytochrome c oxidase subunit I [Arcos sp. 
KU-1491 


0 


up-reg E2 


ndSHM-NPt1-C05 


C9 protein [Oncorhynchus mykiss] 


8.96E-18 




ndSHM-NPt1-C06 


pentraxin [Cyprinus carpio] 


9.55E-15 


up-reg E2 


ndSHM-NPt1-C09 


Very-long-chain acyl-CoA synthetase (Very- 
long-chain-fatty-acid-CoA ligase) 
>gi|2645721 |gb|AAB87982.1 1 very-long- 
chain acyl-CoA synthetase [Mus musculus] 


1.09E-13 


up-reg E2 


ndSHM-NPt1-C12 


dihydroorotate dehydrogenase electron 
transfer subunit [Clostridium tetani E88] 
>gi|2820441 5|gb|AA036853.1 1 
dihydroorotate dehydrogenase electron 
transfer subunit [Clostridium tetani E88] 


5.60255 


up-reg E2 


ndSHM-NPt1-D04 


hypothetical protein [Plasmodium yoelii 
yoeliil 


1.69055 




ndSHM-NPt1-D05 


Deoxyribonuclease II precursor (DNase II) 
(Acid DNase) (Lysosomal DNase II) 
>gi|751 3450|pir|| JE0205 deoxyribonuclease 
II (EC 3.1.22.1) -pig 
>qi|31 57444|emb|CAA0471 7.1 1 


3.09E-16 
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Deoxyribonuclease II [Sus scrofa] 
>gi|33091 53|gb|AAC39263.1 1 
deoxyribonuclease II fSus scrofa] 






ndSHM-NPt1-D07 


egg envelope protein winter flounder 


4.00E-41 


up-reg E2 


ndSHM-NPt1-D07 


similar to olfactory receptor MOR149-1 [Mus 
musculus] 


4.09975 


dn-reg E2 ! 


ndSHM-NPt1-D09 


CG31752-PA [Drosophila melanogaster] 
>gi|22946779|gb|AAN1 1014.1 |AE003660_32 
CG31 752-PA [Drosophila melanogaster] | 


1.73966 




ndSHM-NPt1-D11 


Fibrinogen alpha (Rattus) 


5.00E-05 


up-reg field 


ndSHM-NPt1-E02 


heparin cof actor II [Danio rerio] 


0 




ndSHM-NPt1-E03 


FIFO-type ATP synthase subunit g [Homo 
sapiens] 


3.32E-22 


up-reg E2 


ndSHM-NPt1-E06 


unknown 






ndSHM-NPt1-E07 


hypothetical protein XP_215519 [Rattus 
norvegicus] 


5.42E-09 




ndSHM-NPt1-E12 


6.2 kd protein [Homo sapiens] 
>gi|12643829|sp|Q9P0U1|OM07_HUMAN 
Probable mitochondrial import receptor 
subunit TOM7 homolog (Translocase of 
outer membrane 7 kDa subunit homolog) 
(Protein AD-014) 

>gi|7688665|gb|AAF67473.1 |AF150733_1 
AD-014 protein [Homo sapiens] 
>gi|1 280461 9|gb|AAH01 732.1 |AAH01 732 
6.2 kd protein [Homo sapiens] 


1.75E-21 




ndSHM-NPt1-F01 


hepatocyte growth factor activator [Rattus 
norvegicus] 


5.12E-17 


up-reg E2 


ndSHM-NPt1-F05 


Unknown 




up-reg E2 


ndSHM-NR1-F07 


complement component C9 [Paralichthys 
olivaceus] 


4.46E-34 


up-reg field 


ndSHM-NPt1-F11 


alanine-glyoxylate aminotransferase 2 
[Homo sapiens] 

>gi|1 743291 3|sp|Q9BYV1 (AGT2JHUMAN 
Alanine-glyoxylate aminotransferase 2, 
mitochondrial precursor (AGT 2) (Beta- 
alanine-pyruvate aminotransferase) (Beta- 
ALAAT II) >gi|12406973|emb|CAC24841 .1 1 
alanine-glyoxylate aminotransferase 2 
[Homo sapiens] 


3.01 E-28 




ndSHM-NPt1-G03 


KIAA1657 protein [Homo sapiens] 


8.65698 


up-reg E2 


ndSHM-NPt1-G07 


Unknown 




up-reg E2 


ndSHM-NPt1-G08 


choriogenin H [Oryzias latipes] 


3.54E-09 


up-reg E2 


ndSHM-NPt1-G11 


glucose-6-phosphatase, catalytic; Glucose- 
6-phosphatase [Rattus norvegicus] 
>gi|567864|gb|AAA74381 .1 1 glucose-6- 
phosphatase 


7.73E-09 


up-reg field 


ndSHM-NPt1-G12 


Orla C4 [Oryzias latipes] 


1.04E-36 


up-reg E2 


ndSHM-NR1-H03 


N-acetylneuraminate pyruvate lyase [Mus 
musculus] >gi|12832930|dbj|BAB22314.1 1 
unnamed protein product [Mus musculus] 
>gi|18490967|gb|AAH22734.1| RIKEN cDNA 
0610033B02 gene [Mus musculus] 
>qi|26353976|dbj|BAC40618.1| unnamed 


3.50E-17 
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protein product [Mus musculus] 






ndSHM-NPt1-H04 


apolipoprotein B - Atlantic salmon (fragment) 
>gi|854620|emb|CAA57449.1 1 apolipoprotein 
B [Salmo salar] 


1.14E-10 


up-reg field 


ndSHM-NPtl-H11 


putative aryl-CoA ligase EncN [Streptomyces 
maritimus] 


0.513537 




MALE/FEMALE 
UNSUBTRACTED 








onM-UUo 


cytochrome P450 (Ictaiurus punctatus) 


O.OOE-oO 


up-reg E2; 
field 


onM-UUZ 


unknown 




up-reg E2 


qljm dao 
onM-bUz 


retinol binding protein 4 (D. rerio) 


1 .QQb-17 




cuii\yi on7 
onM-bOY 


ribosomal protein L35 (galus) 


o nnc no 




onM-bUb 


unknown 




up-reg E2 


onlvl-b12 


Similar to 60S riboxomal protein L18A (D. 

iciiuj 


3.00E-40 




SHM-C03 


ribosomal Drotein P2 fl ounctatus^ 


3.00E-21 




SHM-C07 


C type lectin s (O. mykiss) 


2.00E-11 


dn-rea E2 


SHM-E04 


similar to 60S ribosomal protein L21 


2.00E-15 




onM-UUo 


unknown protein for MGC:64127 (D. rerio) 


c An c co 


up-reg E2 


cum cn-i 


G protein B subunit (Ambystoma tigrinum) 


2.00E-25 




onM-tOY 


precerebellin like protein (0. mykiss) 


7.00b-27 




onlvl-AUb 


AMBP protein precursor microglobulin 


o.UUc-oU 




onlvl-b02 


Natural killer eel enhancement factor (0. 

1 1 iyi\ioo j 


o nnc o << 

o.OOE-31 




SHM-C05 


unknown 




up-reg field 


SHM-B10 


Similar to ribosomal protein L10 (D. rerio) 


1.00E-28 






i mknn\A/n 

Ul ll\l IUWI 1 






0 1 1 1 VI \s\J 1 


i inknnwn 

Ul lr\l IUWI 1 






SHM1 


fnlx/pncvlatp rpHnrta^p 






Oi IIVI^. 1 


vucuuyci hi i ciiuiici 


in npnhsink 
ii i y ci lUai ii\ 


i in-rpn P0 m 
up icy c^, 

EE2, DES, 

NP, MXC 


SHM3 


vltelloaenin beta (W 


in oenbank 




SHM 


Ribosomal orotein S8 


8 00E-45 




SHM26 


chorioaenin 3 

VI 1 \mf 1 IVM Vl III 1 ^mJ 






SHM6 


Unknown 






SHM7-3 


choriogenin 2 


1 .00E-45 








ii i y ci lUdi ir\ 




9HMQ-1 

Of 1 IVI v7 1 


rihoQnmal nrotpin 1 ft 

1 lUUoUl 1 Idl [JIUICIII LO 






SHM74-1 


^-hx/HrnYV-^-mpthx/lnli itarvl-f^n A rpHi mt£icp 
o I iy ui UAy J 1 1 icu ly ly i u lai yrvun I cuuviaoc 


Q nnF-^1 


Hn-rpn F^ 
un i cy co 


Of 1 IVI I I 


i i a\ ioi ci 1 ii i 




un icy czz. , 

EE2, DES, 
NP, MXC 


SHM13-1 


Low molecular mass protein 2 


2.00E-12 




SHM14 


Unknown 






SHM22 


Unknown 






SHM23-1 


Ribosomal protein S9 like 


6.00E-71 




SHM24 


Unknown 






SHM25 


Ribosomal protein S9 like 


2.00E-45 
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SHM39 


Unknown 






SHM41 


Ubiquitin-conjugating enzyme 9 


EST match 
(putative) 


up-reg NP 




Unknown 






O l_J H A A o 

SHM43 


Unknown protein, Acession 

MUM lUtJI nnn IUOj/ 


4.00E-23 




SHM48 


Unknown 






SHM48-2 


Unknown 






onivio i o 


Ul IfM iUWI 1 








1 Inkn/^YA/n 

V-> \ IvJWl 1 






SHM62-2 


Hepatic lipase precursor 


7.00E-06 




SHM72-3 


Coagulation Factor XI 




up-reg E2; 
NP, MXC 


SHM73 


Unknown 






SHM76-2 


Alphal-microglobulin/bikunin precursor 
(AMBP) protein 


1.00E-11 


d-reg E2; 
EE2, DES, 
NP, MXC 




Estrogen receptor alpha 




up-reg E2; 
EE2, DES, 
NP, MXC, 
ES 
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LMB_RIBO L6 


LMB_Srnp D1 


LMBJ3LUT-PEROX III 


LMB_ABMP 


LMB_COMP FACTOR 
Bf/C2 




LMB# 




Ribosomal protein 
L6 


Small 

ribonucleoprotein D1 
polypeptide (16kD) 


Glutathione 
peroxidase III 


ABMP precursor 


Putative 

complement factor 
Bf/C2 




Gene ID 


LARGEMOUTH 
BASS GENES 


cn 




CO 


ro 






Number 


Sequence ID 


















SHM IKIGF-1 


SHM IK5D 

■ 


SHM IK25C 


SHMIK24E 


SHM IK7A 




LMB# 




Liver 


Liver 


Liver 


Liver 


Liver 




Gene ID 


SHEEPSHEAD 
MINNOW 
GENES 


Ol 


Ol 


Ol 
CO 


Ol 


Ol 

— x 




Number 


Sequence ID 





LMBJ.ECTIN STL2 


LMB_CYTO-C OX 1 


LMB_ZPC1 


LMBJVIYOSIN LIGHT 




LMB# 




Rhamnose binding 
lectin STL2 


Cytochrome c 
oxidase subunit I 


ZPC1 


myosin regulatory 
light chain 




Gene ID 


CD 


CO 


-si 


CD 




Number 


Sequence ID 












ndSHM-FT1»A10 


ndSHM-FT1-A09 


ndSHM-FT1-A03 


SHM IKIGF-2 




i" 
3 

00 

% 


Liver 


Liver 


Liver 


Liver 




Gene ID 


cn 

CD 


cn 

00 


_ * 

cn 


cn 

CD 




Number 


Sequence ID 



LMB_RIBO L7A 


LMB_ALDOLASE-B 


LMB_EMAP2 


[ 

i 

= 


s 

30 
* 


60S ribosomal 
protein L7A 


Aldolase b 


Echinoderm 
microtubule 
associated protein 
like 2 




:p 

3 

CD 

5 




ro 




o 




Number 


Seauence ID 












ndSHM-FT1~B03 


ndSHM-FT1-A12 


ndSHM-FT1-A11 




i" 
3 

w 

*t 


Liver 


Liver 


Liver 




Gene ID 


O) 
N> 




CD 
O 




Number 


Sequence ID 



o 

CO 



'o 
o 



m 

m 
z 
— i 



-o O 
p 3 

9 3 

(D 

13 
i— 

o 

CO 



en 



3 
Q. 
CO 



Q_ 
CO 



-n 
H 
i 



— 1 

i 

ra 



2 

CO 
CO 

5 



5 

X 
O 

z 



CO 
CO 

5 



—i 

o 
E? 
o 
3 

O" 

3" 



O 

CD 
3 
CD 



CO 



C 

3 
cr 
o 



CO 
CD 

n 
c 

o 

3 

O 
CD 



3 
Q. 
CO 



8 



Tl 

H 



03 



LMB_ALPHA-TUBULIN 


LMB_RIBO L23A 


LMBJH-ATPASE- 
SUBUNIT 

i 


LMB_RlBO L7 




LMB# 


alpha tubulin 

i 
i 


Ribosomal protein 
L23a 


H+-ATPase subunit, 
oligaomycin 
sensitivity conferring 
protein 

i 


Ribosomal protein 
L7 




Gene ID 


CD 


00 




o> 




Number 


Sequence ID 












i 

ndSHM-FT1-D06 


I 

ndSHM-FT1-C09 I 


t 

ndSHM-FT1-C04 


i 

ndSHM-FT1-C03 




r- 
S 

CO 

*fc 


Liver 


Liver 


Liver 


i 

' I i\/pr 




Gene ID 


<j) 

CD 


00 




CD 

a> 




Number 


Sequence ID 



ON 
-0 



LMB_ApoH 


LMB_NASCENT- 
POLYPEP 


LMB__VTG 


LMB_RIBO-Sa 


i 


3 

8 


Apoliporotein H 


Nascent polypeptide- 
associated complex, 
alpha polypeptide 


Vitellogenin prcursor 


40S ribosomal 
protein Sa 




Gene ID 


ro 

CO 


IO 

ro 


ro 


ro 
o 




Seauence ID 
Number 












ndSHM-FT1-E02 


ndSHM-FT1-E01 


ndSHM-FT1-D12 


ndSHM-FT1-D10 


r- 
S 

CD 
*fc 


Liver 


Liver 


Liver 


Liver 


Gene ID 


w 


ro 




o 


Seauence ID 
Numb r 



LMB_CIS-RETIN 
DEHYDRO 


LMB_FIB-B-SUBUNIT 


LMBJJNNAMED- 
PROTEIN 


LMB_SOL-CAR-25A#5 


LMBJTBT-BP 


[ 

| 


S 

33 
fc 


cis-retinol 
dehydrogenase 


Fibrinogen B subunit 


Unnamed protein 
product 


solute carrier family 
25 alpha member 5 


TBT-binding protein 




Gene ID 


00 


ro 




Ol 






Number 


Seauence ID 














ndSHM-FT1-F06 


ndSHM-FT1-F01 


ndSHM-FT1-E12 


ndSHM-FT1-E09 


ndSHM-FT1-E08 




LMB# 


Liver 


Liver 

-! 


Liver 


Liver 


Liver 




Gene ID 


"-si 

00 


-J 


o> 


Ol 


4^ 




Numb r 


Seauence ID 



LMB_ABC-TRANS 


LMBJ-DL RC 


LMB_SENES-ASSOC 
PROTEIN 




S 
H 

It 


ABC transporter 


LDL receptor 


Putative senscence- 
associated protein 




Gene ID 


CO 


o 


ro 
to 




Number 


Seauence ID 










ndSHM-FT1-F11 


ndSHM-FT1-F10 


ndSHM-FT1-F09 




LMB# 


Liver 


Liver 


Liver 




Gene ID 


00 


00 

o 


CO 




Number 


Seauence ID 



o 



LMB_ANTITHROMBIN 
III 


LMB_TFIIIA 


LMB_SERPIN-CP9 


LMB_CATHEPSIN B 




LMB# 


Antithrombin III 


Transcription factor 
IIIA (TFIIIA) 


Serpin CP9 


Cathepsin B 




Gene ID ! 


CO 
CJl 


CO 


CO 
CO 


CO 

ro 




Number 


Sequence ID 












ndSHM-FT1-G08 


ndSHM-FT1-G04 


ndSHM-FT1-G02 


ndSHM-FT1-F12 




LMB# 


Liver 


Liver 


Liver 


Liver 




Gene ID 


_x 
00 
Ol 


oo 


00 
CO 


00 

ro 




Number 


Sequence ID 



LMBJHAPTOGLOBIN 


LMB_WEE-I 


LMB_RIKEN 
1810056020 


i 

i 


3 

XI 
tfc 


Haptoglobin 


Wee I tyrosine 
kinase 


RIKEN cDNA 
1810056020 




Gene ID 


CO 
00 


CO 


CO 




Number 


Seauence ID 










ndSHM-FT1-H04 


ndSHM-FT1-H03 


ndSHM-FT1-H02 




r 
3 

CD 


Liver 


I i\/or 


Liver 




Gene ID 


00 
00 


00 


00 
O) 




Number 


Sequence ID 



■> 

T) 

o 

rn 



> £ 
z S 

if 



00 

■> 

"0 

o 
> 



CD 



> 
no 
O 

"6' 
o 

T3 

3 

«— »- 

CD 

m 



if 

O ' 



CD 
O 

c 

0) 

o 



0) 



2 > 
"0 

O 
*u 

o 

H 

m 



O 

CD 
3 
CO 



4^ 
O 



CD 



c 
3 

CD 



CO 
CD 
D 
C 

cd 

3 

O 
CD 



13 
Q. 
CO 
X 



o 



13 
CO 



X 
o 

-si 



3 

CO 
X 



X 
o 



03 



< 

CD 



< 
CD 



o 

CD 
3 
CD 

5 



CD 



CO 
O 



00 
CD 



3 

CD 



(A 
CD 
Q 
C 
CD 
3 
O 
CD 



DO 

1 

H 


LMB_ATPASE 6 


LMBJ-ECTIN 9 


LMB_ZPC4 




LMB# 


inter-alpha-trypsin 
inhibitor "ITI" 


ATPase subunit 6 


C-type lectin 
superfamily 9 


ZPC4 




Gene ID 


en 




CO 


4^ 
ro 




Number 


Sequence ID 












ndSHM-MC1-A03 


ndSHM-MC1-A02 


ndSHM-FT1-H12 


ndSHM-FT1-H11 




LMB# 


Liver 


Liver 


Liver 


Liver 




Gene ID 


CO 

cn 


_k 

CO 


CO 
W 


CO 

ro 




Number 


Sequence ID 



4^ 



LMB_PTTP 


LMB_HEPCIDIN 


LMB_EIF-3#7 




LMB# 


Pituitary tumor 
transforming protein 


Hepcidin precursor 


Eukaryotic 
translation initiation 
factor 3 subunit 7 




Gene ID 


00 




CD 




Number 


Sequence ID 










• 

ndSHM-MC1-A07 


ndSHM-MC1-A05 


_ JO 1_J R A Ml/"* 4 AA/ 

naSHM-MC1-A04 




LMB# 


Liver 


Liver 


Liver 




Gene ID 


_a 

CO 
00 


—i 

CO 


CO 
CD 




Number 


Sequence ID 



'o 

D 
O 

ro 
i 



o 
ro 

i 

3 



Si 

Q 

o 

— I 

o 

3J 



< o 

= o 

0) 
CO 

c_ 

0) 

o' 
5T 

a 

o 



en 



cn 
o 



Q_ 
CO 



o 

I 

CD 
O 



3 
Q_ 
CO 



o 

> 



< 

CD 



O 



ro 
o 
o 



'H 

O 
X 



CD 



H 
o 

X 

3* 



O 
cd 

CD 



CD 



z 

C 

3 

CD 



C/) 
CD 
Q 
C 
CD 
3 
O 
0 



3 
Q. 
CO 



o 

t 

o 

CD 



□3 



o 

CD 
CD 



z o 



CD 
CD 



c 

3 

or 

CD 



LMB_KALLIKREIN 


LMB_L10A 


LMB_CYTO-C OX I 


LMB_WARM-WATER 
ACC PROTEIN 




LMB# 


Kallikrein 


60S ribosomal 
protein L10A 


Cytochrome c 
oxidase subunit II 


Warm water 

acclimation-related 

protein 




Gene ID 


cn 
cn 


cn 


cn 

CO 


cn 




Number 


Sequence ID 












i 

ndSHM-MC1-B08 


ndSHM-MC1-B05 


ndSHM-MC1-B04 


ndSHM-MC1-B03 




LMB# 


Liver 


Liver 


Liver 


Liver 




Gene ID 


O 

cn 


2 


o 

00 


O 

ro 




Number 


Sequence ID 



LMB_FILAMEN-B 


LM B_H APTOG LO B 
RELATED PROT 


LMB_ALPHA-2- 
MACROGLOB-1 


LMBJDANIO EST 
3818635 


! 


r 
3 

8 


Filamen B 


Haptoglobin-related 
protein 


alpha-2- 
macroglobulin-1 


Danio EST 
IMAGE:3818635 




Gene ID 


cn 

CO 


00 


Ol 


Ol 




Number 


Seauence ID 












ndSHM-MC1-C04 


ndSHM-MC1-C02 


ndSHM-MC1-B11 


ndSHM-MC1-B10 




LMB# 


Liver 


Liver 


Liver 


Liver 




Gene ID 


ro 
o 

CO 


ro 
o 

00 


ro 
o 


N> 
O 

cn 




Number 


Seauence ID 



oo 



LMBJTATA BOX BP 


LMB_MITOCHON-ATP- 
SYNTHASE 


LMB_RXR INTERACT 
PROT 


LMB_UBIQUITIN 




r 
3 

Da 
a= 




TATA-box binding 
protein 


ATP synthase alpha 
chain mitochondrial 
precursor 
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Other Embodiments 

It is to be understood that while the invention has been described in conjunction 
with the detailed description thereof, the foregoing description is intended to illustrate 
and not limit the scope of the invention, which is defined by the scope of the appended 
5 claims. Other aspects, advantages, and modifications are within the scope of the 
following claims. 

What is claimed is: 
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